AFRL-OSR-VA-TR-20 13-0629 


ESTIMATION  AND  CONTROL  FOR  LINEAR  SYSTEMS  WITH 
ADDITIVE  CAUCHY  NOISE 


JASON  SPEYER 

REGENTS  OF  THE  UNIV  OF  CALIFORNIA 

12/17/2013 
Final  Report 


DISTRIBUTION  A:  Distribution  approved  for  public  release. 


AIR  FORCE  RESEARCH  LABORATORY 
AF  OFFICE  OF  SCIENTIFIC  RESEARCH  (AFOSR)/RSL 
ARLINGTON,  VIRGINIA  22203 
AIR  FORCE  MATERIEL  COMMAND 


file:/A\52zhtv-fs-725v\CStemp\adlib\input\wr_export_13 1223 134732_946805803...  12/23/2013 


REPORT  DOCUMENTATION  PAGE 

Form  Approved 

OMB  No.  0704-0188 

Public  reporting  burden  for  this  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and  maintaining  the 
data  needed,  and  completing  and  reviewing  this  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information,  including  suggestions  for  reducing 
this  burden  to  Department  of  Defense,  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports  (0704-0188),  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington,  VA  22202- 
4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  any  penalty  for  failing  to  comply  with  a  collection  of  information  if  it  does  not  display  a  currently 
valid  OMB  control  number.  PLEASE  DO  NOT  RETURN  YOUR  FORM  TO  THE  ABOVE  ADDRESS. 

1 .  REPORT  DATE  (DD-MM-YYYY)  2.  REPORT  TYPE 

12-17-2013  FINAL 

3.  DATES  COVERED  (From  -  To) 

9/30/2010-9/29/2013 

4.  TITLE  AND  SUBTITLE 

ESTIMATION  AND  CONTROL  FOR  LINEAR  SYSTEMS  WITH  ADDITIVE  CAUCHY 
NOISE 

5a.  CONTRACT  NUMBER 

FA9550-1 0-1 -0570 

5b.  GRANT  NUMBER 

5c.  PROGRAM  ELEMENT  NUMBER 

6.  AUTHOR(S) 

Jason  L  Speyer 

5d.  PROJECT  NUMBER 

5e.  TASK  NUMBER 

5f.  WORK  UNIT  NUMBER 

7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

University  of  California,  Los  Angeles 

8.  PERFORMING  ORGANIZATION  REPORT 
NUMBER 

9.  SPONSORING  /  MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 

USAF/  Office  of  Scientific  Research 

Elaine  A  Ubas  7035888391 

CODE  FA9550 

10.  SPONSOR/MONITOR’S  ACRONYM(S) 

USAF 

11.  SPONSOR/MONITOR’S  REPORT 
NUMBER(S) 

12.  DISTRIBUTION  /  AVAILABILITY  STATEMENT 

Distribution  A  -  Approved  for  Public  Release 

13.  SUPPLEMENTARY  NOTES 

14.  ABSTRACT 

A  new  class  of  scalar  and  vector-state  estimators  and  stochastic  controllers  for  linear  dynamic  systems  with  additive  Cauchy 
process  and  measurement  noises  has  been  developed.  The  Kalman  filter  and  the  linear-quadratic-Gaussian  controller  have 
been  the  main  estimation  and  control 

paradigms  in  modern  engineering.  However,  many  practical  system  noises,  such  as  radar  glint,  are  better  described  by  heavy 
tailed  probability  density  functions  (pdf).  Although  the  Cauchy  pdf  has  an  infinite  variance,  the  conditional  density  of  a  Cauchy 
random  variable,  given  a  linear  measurement  with  an  additive  Cauchy  noise,  has  a  conditional  mean  and  a  finite  conditional 
variance,  both  being  functions  of  the  measurement. 

15.  SUBJECT  TERMS 

16.  SECURITY  CLASSIFICATION  OF: 

17.  LIMITATION 

OF  ABSTRACT 

18.  NUMBER 
OF  PAGES 

19a.  NAME  OF  RESPONSIBLE  PERSON 

Jason  Speyer 

a.  REPORT 

b.  ABSTRACT 

c.  THIS  PAGE 

19b.  TELEPHONE  NUMBER  (include  area 
code) 

310-206-4451 

Standard  Form  298  (Rev.  8-98) 


Prescribed  by  ANSI  Std.  Z39.18 


AFOSR  Grant  #  FA9550-1 0-1 -0570 
Estimation  and  Control  for  Linear  Systems  with  Additive 

Cauchy  Noise 

Final  Report 

Air  Force  Office  of  Scientific  Research 

Dr.  Fariba  Fahroo,  Program  Manager,  Dynamics  and  Controls 

Jason  L.  Speyer 

Mechanical  and  Aerospace  Engineering  Department 
University  of  California,  Los  Angeles 
Los  Angeles,  CA  90095-1597 

A  new  class  of  scalar  and  vector-state  estimators  and  stochastic  controllers  for  linear  dynamic 
systems  with  additive  Cauchy  process  and  measurement  noises  has  been  developed.  The  Kalman 
filter  and  the  linear-quadratic-Gaussian  controller  have  been  the  main  estimation  and  control 
paradigms  in  modern  engineering.  However,  many  practical  system  noises,  such  as  radar  glint,  are 
better  described  by  heavy  tailed  probability  density  functions  (pdf).  Although  the  Cauchy  pdf  has 
an  infinite  variance,  the  conditional  density  of  a  Cauchy  random  variable,  given  a  linear  measure¬ 
ment  with  an  additive  Cauchy  noise,  has  a  conditional  mean  and  a  finite  conditional  variance,  both 
being  functions  of  the  measurement.  Over  the  last  three  years,  a  theory  of  estimation  and  stochas¬ 
tic  control  has  been  developed  for  the  vector  state  linear  dynamic  system.  The  methodology  for 
scalar  state  systems  entailed  propagation  of  the  conditional  pdf,  while  the  vector  state  case  was 
addressed  by  developing  a  recursion  for  the  analytic  propagation  of  the  character  function  of  the 
unnormalized  conditional  pdf  (ucpdf).  Through  a  spectral  transformation,  the  character  function 
of  the  ucpdf  is  used  explicitly  in  the  development  of  stochastic  controllers  for  vector- state  systems. 


i 


Contents 


Abstract  i 

1  Introduction  1 

2  Current  Results  on  Cauchy  Filters  and  Controllers  3 

2.1  Formulation  and  Derivation  of  the  Cauchy  Estimator . 3 

2.1.1  Characteristic  Function  for  the  Un-normalized  Conditional  pdf . 4 

2.1.2  Conditional  Mean  and  Variance  at  k=\ . 5 

2.1.3  Propagation  to  k=  2  and  the  Second  Measurement  Update . 6 

2.1.4  General  form  of  <Pxk\zk(Y) . 6 

2. 1 .5  The  Mutlivariable  Estimator . 8 

2.1.6  Numerical  Examples  of  the  Vector-State  Cauchy  Estimator . 8 

2.2  The  Stochastic  Model  Predictor  Cauchy  Controller . 9 

2.2.1  Cost  Criterion  for  Controller  of  Cauchy  System . 11 

2.2.2  The  Conditional  Performance  Index  in  Teims  of  the  Spectral  Variables  ...  12 

2.2.3  Closed-Form  Cost  Criterion:  Scalar-  State  System . 12 

2.3  Numerical  Examples  of  the  Scalar  and  Two-State  Stochastic  Controller . 13 

2.3.1  One-Step  One-Measurement  Examples  for  Scalar  and  Two-state  Systems  .  .  13 

2.3.2  Multi-Step  Numerical  Example  for  the  Scalar  and  Two-State  System . 15 

3  Conclusions  17 

4  References  19 

A  Appendix:  Multivariate  Cauchy  Estimator  with  Scalar  Measurement  and  Pro¬ 
cess  Noises  22 

B  Appendix:  Multivariate  Cauchy  Estimator  with  Scalar  Measurement  and  Pro¬ 
cess  Noises  47 

C  Appendix:  Linear  Dynamic  Systems  with  Additive  Cauchy  Noises  Part  1: 
Stochastic  Estimation  for  Two-State  Systems  56 

D  Appendix:  A  Stochastic  Controller  for  a  Scalar  Linear  System  with  Additive 
Cauchy  Noise  63 

E  Appendix:  A  Stochastic  Controller  for  Vector  Linear  Systems  with  Additive 
Cauchy  Noises  78 

li 


F  Appendix:  Linear  Dynamic  Systems  with  Additive  Cauchy  Noises 
Stochastic  Model  Predictive  Control 


Part  2: 

91 


1  Introduction 


In  many  engineering,  economic,  telecommunications,  and  science  applications  the  underlying  ran¬ 
dom  processes  or  noises  have  significant  volatility,  which  are  not  captured  by  Gaussian  distributions 
[1].  Rather  than  light-tailed  Gaussian  distributions,  heavy-tailed  distributions  have  been  shown 
to  better  represent  these  volatile  random  fluctuations.  Examples  from  the  practical  engineering 
world  include  radar  and  sonar  sensor  noises  [2] ,  air  turbulent  environment  noise  [3] ,  and  adversarial 
motion.  Our  objective  is  to  develop  estimation  and  stochastic  control  techniques  for  linear  dy¬ 
namic  systems  with  heavy-tail  distributed  noises,  which  are  in  the  class  of  symmetric  alpha-stable 
(Sa-S)  distributions  [4].  In  their  simplified  form,  Sa-S  distributions  of  scalar  random  variables 
are  characterized  by  their  characteristic  function  expressed  as  cp(v)  =  e_crC,|'/|a’,  where  o  and  or 
are  positive  parameters  and  v  is  the  spectral  variable.  In  this  class,  cr  =  0-5,  1,  2  yield  the  Le/y, 
Cauchy  and  Gaussian  distributions,  respectively.  For  a  e  (0,  2),  all  the  densities  have  infinite 
variance. 

In  detection  of  a  radar  signal  in  clutter,  the  in-phase  component  of  radar  clutter  time  series 
agrees  extremely  well  with  a  Sa-S  probability  density  function  (pdf)  with  a  =  1.7  [5].  For 

a  e  [1,  2]  a  maximum  likelihood  Cauchy  detector,  which  is  in  the  class  of  myriad  filters  [6], 
exhibited  performance  that  is  very  close  to  the  Cramer-Rao  bound,  whereas  a  maximum  likelihood 

Gaussian  detector  deviated  significantly  as  a  varied  from  2  to  1.  Although  numerically  intensive, 
the  myriad  filters  or  detectors,  based  on  a  cost  criterion  derived  from  the  a-stable  pdfs,  show 
significant  improvement  in  detecting  a  signal  in  heavy-tailed  noise  over  Gaussian  detectors  [7]. 
Similar  performance  was  observed  when  processing  data  in  a  multi-user  communication  network 
[8].  The  shortcomings  of  the  Kalman  filter  and  the  linear-quadratic-Gaussian  (LQG)  controller 
when  processing  non-Gaussian  noise  data  in  the  context  of  aircraft  navigation  [9]  and  radar  glint 
[10]  suggest  that  new  filters  and  controllers  for  heavy-tailed  pdfs  are  required. 

The  robustness  of  Cauchy  detectors  discussed  above,  together  with  the  deficiencies  of  the 
standard  Gaussian  estimators  and  controllers  when  exposed  to  impulsive  noises,  motivated  the 
derivation  of  a  sequential  non-linear  estimator  for  scalar  linear  dynamic  systems  with  additive 
Cauchy  process  and  measurement  noises  [11, 12].  Although  Cauchy  noises  do  not  have  a  well 
defined  first  moment  and  have  infinite  second  and  higher  moments  [13],  the  conditional  pdf  (cpdf) 
of  the  system  state  given  the  measurement  history  was  determined  analytically  and  shown  to  have 
well  defined,  finite  conditional  first  and  second  order  moments  [11]. 

Over  the  last  three  years,  we  have  developed  the  theoretical  basis  for  constructing  minimum 
variance  estimators  and  stochastic  controllers  for  this  class  of  systems.  In  our  initial  results  for 
scalar  dynamic  systems,  the  conditional  pdf  of  the  scalar  state  given  the  measurement  sequence 
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could  be  generated  directly  [11].  Partial  fractions  were  used  to  update  the  conditional  pdf  at 
each  measurement  time.  However,  this  methodology  does  not  generalize  when  addressing  dynamic 
systems  with  a  vector- state.  In  [12]  a  scalar  estimator  was  derived  by  generating  the  characteristic 
function  of  the  unnormalized  cpdf  (ucpdl)  in  a  recursive  scheme.  Although  the  two  scalar  estimator 
formulations  are  similar  in  form,  the  characteristic  function  approach  is  somewhat  simpler  than 
the  scheme  in  [11],  It  allows  for  a  stronger  result  regarding  the  decay  of  the  estimator  parameters 
with  time.  Moreover,  it  was  shown  that  this  approach  can  be  generalized  to  the  multi-variable 
case  [14,  Appendix  A],  [15,  Appendix  B]  and  for  the  special  two-state  system  [16],  [17,  Appendix 
C]. 


As  shown  in  [14,  Appendix  A],  the  characteristic  function  of  the  ucpdf  is  composed  of  a  sum  of 
terms.  Each  term  has  the  form  of  a  coefficient,  which  is  a  function  of  the  sign  of  a  linear  function  of 
the  spectral  variables,  multiplied  by  an  exponential  whose  argument  is  a  sum  of  absolute  values  of 
the  same  linear  functions  of  the  spectral  vector.  This  analytic  form  persists  through  measurement 
updates  and  dynamic  propagation.  Since  the  characteristic  function  of  the  ucpdf  is  shown  to 
be  twice  continuously  differentiable  [14,  Appendix  A],  the  conditional  mean  and  the  conditional 
variance  are  determined  by  evaluating  the  characteristic  function  and  its  first  two  differentials  as 
the  magnitude  of  the  spectral  vector  goes  to  zero. 

Although  a  dynamic  programming  solution  to  the  stochastic  control  problem  does  not  currently 
appear  tractable,  a  stochastic  controller  has  been  obtained  for  the  model  predictive  or  open-loop 
feedback  formulation  in  the  scalar  case  [18]  and  [19,  Appendix  D],  and  extended  to  the  vector 
case  [20,  Appendix  E],  [21,  Appendix  F],  To  ensure  that  the  unconditional  expectation  of  the  cost 
function  exists  and  remains  finite  for  Cauchy  uncertainties,  and  that  the  conditional  expectation 
of  the  cost  function  can  be  determined  in  closed  form,  a  new  cost  function  is  proposed.  This 
cost  function  is  composed  of  products  of  penalty  functions  on  the  state  and  control  that  are  in 
the  form  of  the  Cauchy  distribution.  Although  the  conditional  expectation  of  the  cost  function 
can  be  determined  from  the  conditional  pdf  in  the  scalar  problem,  for  the  vector-state  control 
problem  Parseval’  s  Identity  is  used  to  transform  the  conditional  expectation  of  the  cost  function 
to  spectral  domain.  Then,  the  characteristic  function  of  the  ucpdf  is  used  directly  to  determined 
the  conditional  expectation  of  the  cost  function  in  closed  form  [20,  Appendix  E],  [21,  Appendix  F]. 
Results  on  the  performance  of  this  stochastic  Cauchy  controller  strongly  indicate  that  it  handles 
outliers  in  the  measurements  dramatically  well  compared  to  Gaussian  controllers.  We  present  our 
current  results  for  the  vector  discrete-time  Cauchy  filter  and  stochastic  controller  in  section  2. 
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2  Current  Results  on  Cauchy  Filters  and  Controllers 


Over  the  last  three  years  we  have  obtained  impressive  results  for  vector-state  Cauchy  estimation, 
which  are  detailed  in  [11, 12, 14  -  17,  Appendices  A,  B  &  C],  are  summarized  in  section  2.1  and  our 
novel  control  results,  which  are  detailed  m  [18  -  21,  Appedices  D,  E  &  F],  are  summarized  m  section 

2.2. 

2.1  Formulation  and  Derivation  of  the  Cauchy  Estimator 

The  single-input  single-output  multivariable  linear  dynamic  system1  is 


xk+  i  =  Oxk+rwk,  zk  =  Hxk  +  vk, 


(1) 


where  the  state  vector  xk  e  Rn,  scalar  measurement  z^,  and  known  matrices  0  6  Rnxn,  T  e 

Rnxl,  and  H  e  Rlxn.  The  noise  inputs  are  assumed  to  be  independent  with  know  Cauchy  pdf. 
Specifically,  wk  is  assumed  to  be  Cauchy  distributed  with  a  zero  median  and  a  scaling  parameter 
(3  >  0.  Similarly,  vk  has  a  Cauchy  pdf  with  a  median  of  zero  and  a  scaling  parameter  y  >  0.  The 
characteristic  functions  of  these  scalar  noises  are  assumed  to  be  time  independent  and  given  by 

cpw  v  =  e-/3|'/|,  cpv  v  =e~v (2) 


where  these  characteristic  functions  have  a  scalar  argument  v.  The  initial  conditions  at  k  =  1  are 
also  assumed  to  be  independent  and  Cauchy  distributed.  Specifically,  each  /- th  element  Xp  of  the 
initial  state  vector  X\  has  a  Cauchy  pdf  with  a  zero  median  and  a  scaling  parameter  or,  >0,  /  = 
l, . . . ,  n.  The  characteristic  function  of  the  joint  pdf  of  the  initial  conditions,  which  is  a  function 
of  a  /7-dimensional  spectral  variable  v  e  Rn,  is  given  by 


<Px,  v 


171 


/=  1 


I 


\ 


FI 


\ 


ctiVi  I  n  ■  •  n  11'  1 

r  1  =CXP  -  O/l  V/ 1  =  exp  -  P/|fa(,  v)\  +j(b{,  v)  ■  (3) 


/=  1 


/=  1 


The  last  form  was  introduced  for  notational  convenience  to  be  used  in  the  sequel.  We  used  the 
definitions 

p1  1  1 

/  =  cti,  a ,•  =  eh  i=\,...,n,  b{  =  {0}n,  (4) 

where  e,-  is  a  /7-dimensional  /-th  unity  vector  and  {0}n  is  /7-dimensional  vector  of  zeros. 

'The  single-input  single-output  restriction  only  simplifies  the  presentation  and  appears  to  be  easily  relaxed. 
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2.1.1  Characteristic  Function  for  the  Un-normalized  Conditional  pdf 


Our  goal  is  to  compute  the  minimum  variance  estimate  of  X/<  given  the  measurement  measurement 
history  or  Zk  =  {zhz2,  ■  ■  ■  ,zj  [14,  Appendix  A],  We  begin  by  determining  the  characteristic 
function  for  the  un-normalized  conditional  pdf  at  k  =  1 ,  where  the  conditional  pdf  at  k  =  1  is 


fx,|z,  X!  |  Z | 


fX]Z]  XhZi  fZi\Xi  Z 1  |  Xi  fXl  X!  fv  Zi  —HXi  fXl  X| 
fzi  Zi  ^Zi  Zi  fzt  Zi 


(5) 


The  unnormalized  conditional  pdf  (ucpdf)  at  k  =  1  is  simply  the  joint  pdf  of  the  vector  state 
and  scalar  measurement  and  we  will  work  explicitly  with  the  form  fv(zx  —  Hxy)fXx  (x,).  The 
characteristic  function  of  the  ucpdf  is  obtained  as 


cp  <y)  = 

XiIZi 


po 


— oo 


po 

f  (x  )f  (z  —  Hx  je'ht  vdx  = 

Xi  1  V  1  1  J 

—  OO 


1  P° 
(2tT)n  m 


po 

(p  ( v-n)(p  ( n)dn-  (6) 

X,  V 

— oo 


The  first  integral  is  a  Fourier  transform  of  a  product  of  two  functions.  Using  the  dual  convolution 
property,  the  second  integral  is  a  convolution  in  the  v  domain  between  the  associated  characteristic 

functions  <Pxx(y)  given  in  (3)  and  the  characteristic  function  of  /z,  \x{  (Z\  |  xx)  =  fv(Z\  —Hxx), 

determined  in  [14,  Appendix  A]  as 


po  po 


eTv 


cp/(v)-  ...  fv(zx  -  Hxi)eixivdxi  =  K(v)cpv  ~tjx~  2n5(ejPnv), 

ri6n  ;  -i 


— oo  — oo 


i=l 

( .  r\  . 

Ter  Jz  i©nV  -  <  ( 


fi/neve  Hen/=  0 ,Pn  =  I  -  n ,  K(v)  =  exp  n  ,  and  (pv  -  e"v  =exp  ~Y  e 


(7) 

\ 


Hen  Hen  Hen  Hen 

Substitution  of  the  n  —  1  delta  functions  of  (7)  into  (6)  reduces  the  n  integrals  to  one  as 


1  Xi|Z, 


<V)  = 


jGO  jGO 

2^  <PxSv  ~  ( _o)eiz'ado=  _L  ^(v  -  HTo)cpv  {-o)ejz'ado, 

— oo  — oo 


(8) 


where  (pv{~d)  =  e  K|a|. 

The  convolution  integral  in  (8)  is  solved  in  closed  form.  Here  we  assume  that  each  element  in 
H  is  non-zero,  i.  e.,  He,  /=  0,  /  =  {\,  ■  ■  ■ ,  n}.  (See  [14,  Appendix  A]  regarding  relaxing 

this  condition.)  To  compute  the  integral,  we  define  P/=  of/|/-/e,j,  p,  =  eTv/Heh  i  =  { l, 

n}. 
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Pn+l  =  Y, 


Po  =  ~  00  and  pn+ 1  =  00 .  Hence,  the  convolution  integral  is  restated  and  solved  as 


_  po 

cp  (v)=  1 


X1IZ1 


fci  pi\!Ji-o\+jz\odO  = 


1 

27T 


n+1 


R=  1 


g"1(yllV))erV» 
i  gi 


(9) 


where 


gill  Hi  Hi  _1  Hi  _1 

/  (yg/  ('/))  =  (Jz  1  +  Pr  +  yg/  (\/))  -  (y’Zi  -  pR  +  ygi  ( 1/ ))-  , 


/7+1 

n+1 

Pi  sign (p/  -  Pr), 

ye!  V)  =  - 

p/|p,  -  pr|  +yz,pR. 

(10) 

(11) 


As  shown  in  this  first  update,  the  coefficient  (10)  is  always  a  function  of  sign(-)  given  by  ybl(v) 
in  (11)  where  the  argument  of  the  sign(-)  is  always  found  in  the  argument  of  the  absolute  value 
given  by  ylj\v)  in  (11).  This  is  a  general  property  the  recursion  for  the  characteristic  function  of 
the  ucpdf  for  any  update  stage  time  k. 


2.1.2  Conditional  Mean  and  Variance  at  k=  1 


The  conditional  mean  and  variance  are  computed_by  evaluating  cpx l|Z|('/)  and  its  first  two  deriva¬ 
tives  at  v  =  {0}n  or  alternatively  as  v  -*>  {0}n.  <  x,|z,  can  be  shown  t0  be  twice  continuously 
differentiable  [14, 15,  Appendices  A  &  B].  Thus,  its  first  two  derivatives  can  be  evaluated  along  a 
fixed  direction  v  =  Ev  while  letting  E  — >  0.  The  pdf  of  the  measurement  variable  is  obtained  as 


/rz1(Zi)=^i|Zi(E^o 


1 


R=  1 

■f 


aR\hR\+Y 


-V 


n  y, 

z\  +  ctr  I  /?r  |  +  y 

R=1 


(12) 


The  minimum  conditional-variance  estimate  is  given  by 

_ 1 


Xi(Z!)  = 


y^(zi) 


(  _  \T 

v5(p  (E\p  T 

X1IZ1 


Ev 


=  z  1 


E=  0 


[cr,  sign(hi)  ■  ■  ■  an  sign(/7„)]T 

- f - 

aR\hj\  +y 

R=  1 


(13) 
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The  conditional  variance  is  (X)  =  X\  —  Xi) 


E[xixj'|zl]  = 


f 

1  + 


3/7 

M 


or/I  /J,-|  +yf 


□ 

□ 

E 

0 


\ 


[%■  OT/lh/l+K 


/=2 


□ 


-cr]Crn  signify)  sign(fy) 


: 


. . .  -axOn  sign(/?i)sign(/7„)D 


On 

\hn\ 


I 

n  —  1 


CT/I/7/I  +  y 


tel 


(14) 


Note  that  with  one  measurement,  the  n  x  n  conditional  variance  is  bounded  and  positive  definite. 
Furthermore,  the  conditional  variance  is  an  explicit  function  of  the  measurement. 


2.1.3  Propagation  to  k=  2  and  the  Second  Measurement  Update 


The  time  propagated  characteristic  function  to  k  =  2  is  found  in  [14, 15,  Appendices  A  &  B]  as 


x2|z,('/)  =  <  XilZ,^0  ^ 


r,A*-flr7V| 


(15) 


The  convolution  integral  for  the  second  measurement  update  is 


—  CO 

where  zi  =  {zh  z{}.  Since  <  X|  iZl  (v)  is  twice  differentiable,  then  for  the  linear  transformation 

1  Xi|Zi(Ot\/)  is  also  continuous.  By  assuming  HT  /=  0,  it  can  be  shown  that  the  first  two 
derivatives  of  Z2(v)  are  continuous.  If  HT  =0,  then  e_/3|r‘  comes  out  of  the  convolution 
integral.  Consequently,  <Px,\zSv)  d°es  n°t  have  a  continuous  derivative  because  the  derivative  of 
e“^l  rrd  is  only  piecewise  continuous  and  thus  there  is  no  estimate  of  x  2 


2.1.4  General  form  of  <pxk \zk(v) 

In  general,  the  convolution  integral  for  the  kth  measurement  update  is 


‘  xk\zk 


(v)  = 


2tt 


—  CO 


(16) 
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where  the  measurement  history  is  z k  =  {z\,  ■  ■  ■  ,Zk}.  It  is  assumed  that  HT  /=  0  at  each  time 
step  and  that  by  induction  <  xk^\Yk_  (v)  is  twice  differentiable.  Therefore,  i  Xk\zk(v)  is  twice 
continuously  differentiable.  It  is  expressed  in  a  closed  form  [14, 15,  Appendices  A  &  B]  as 


where 


ft\ k  v 
yxk\zk 


_nt  gk\k  k\k 

/  gi  M  exp  yei  (v)  , 

/'=  1 


(17) 


k  ei  k\k  k\k  k 

ygi  (v)  =  QiR  Sign(fa/R  ,i/J)eR, 

R=  1 


k\k 

Yei  (V)  =  ~ 


k\k 


k\k 


k\k 


PiR  \(aiR 

R=  1 


v)\+j  fy'V). 


(18) 


In  the  above,  qk^k  e  R^,  pk^k,  ak^k  e  Rn,  and  bk^k  e  Rn  are  all  parameters  computed  recursively  up 

iR  iR  iR  i 

to  time  k.  This  functional  form  persists  at  every  time  step. 


Note  that  this  form  (17)  is  also  consistent  with  the  characteristic  function  of  the  ucpdf  for  the 
scalar  system  [12] 


k+2 

cp  (v)=  ~  a(k\k)+jb(k\k)sign(v)  e~^kmv^a^k)v .  (19) 

Xk\zk  i  i 

/=  1 

The  parameters  a,(/c|/c)  and  b,(k\k)  satisfy  a  linear  discrete  dynamic  equation,  from  which  it  is 
shown  that  they  converge  to  zero  as  k  —  i  becomes  large.  Consequently,  these  decaying  terms  can 
be  removed  from  the  sum  in  (19).  This  property  should  generalize  for  the  g  .  y  .(v)  in  (17), 

k  I  k 

in  which  one  of  the  central  goals  of  the  proposed  project  is  to  prune  the  sum  in  (17)  such  that  n  't 
remains  bounded. 


There  are  some  simplifications  that  occur  in  the  two-state  estimation  problem  [16],  [17,  Ap¬ 
pendix  C].  A  recursion  in  the  terms  in  the  argument  of  the  exponential  can  be  made  explicit.  A 


basis  Bk  for  each  term  in  the  sum  is  recursive  as 


□  □ 

Bk-  i®T 

b„  =  °  rT  I 

U  j 

HA 


B  i 


=  fSi  .A  = 


[  v  [ 

HA 


f 

0 

-1 


I 

1  . 

0 


(20) 


This  basis  is  related  to  ak^k  e  Rn  in  the  general  solution  (18)  in  that  every  ak^k  vector  is  coaligned 

iR  iR 

with  a  row  in  Bk.  This  bases  appears  minimal  and  thereby  all  terms  having  the  same  argument 
of  the  exponential  can  be  combined.  Generalization  of  this  result  for  the  multivariate  case  will  be 
considered  also  in  the  new  project. 
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2.1.5  The  Mutlivariable  Estimator 


As  shown  in  [14, 15,  Appendices  A  &  B],  to  construct  the  conditional  mean  and  variance,  choose 

k\k 

v  =  Ev  where  E  >  0  and  v  is  a  fixed  direction.  Assuming  the  condition  that  (ajR  ,v)  0  V  (/,  £), 

then 


k\k  k\k  i 

sign ((aiR  ,  Ev))  =  sign ((aiR  ,  v))  =  sR, 


k\k  el  k\k  i  k\k  k\k  k\k 

ygi  (Ev)  =  qiR  sR  =  ygi  (0),  yei  (EV)  =  E(yei  (0),  v), 

R=  1 


(21) 

(22) 


where  s'  is  a  piecewise  constant  and  (^k(V),  v)  is  a  constant.  The  pdf  of  the  measurement  history 

R  ei 

is  determined  as 

k  |  k 

fzk  Z/c  =  (P  EV  Qk\k  vk\k(V)  .  (23) 

xk\zk  E=0=  y/  ygi 
/=  1 

The  conditional  mean  of  the  state  X/c  is 


1 


Xk  -  E[Xk\zk]  -  ■  f 

J  'Z k  Zk 

and  the  second  moment 


i  -  *  / 

'd(-xk\zk  Ev 

d(EV) 


tn‘ 


k\k 


E= 0 


jfzk  zk  /=; 


k\k  k\k  k\k 

g,  \  (!)  ye  ("V  (24) 


E[Xi,xt  lzk]  = 


1 


n2 
o  ( 


X/,|Zk 


£9 


„k\k 


-1 


9 


/<\k 


/'kCv, 


hzk  zi  S(Ev)d(Ev)T  fZ;  z„  fcl  ' 


U' 


■  (25) 


For  the  error  \  -  Xk  ~  h,  the  conditional  error  variance  can  be  evaluated  as  E[XkXT jzk] 
E[xkxT  \zk]  -  xkxT . 


2.1.6  Numerical  Examples  of  the  Vector-State  Cauchy  Estimator 

Estimation  results  for  a  two-state  system  with  Cauchy  and  Gaussian  noises,  related  by  a  least 
square  fit,  are  shown  in  Fig.  1.  For  the  two-state  system,  data  can  be  streamed  continuously  by 
using  a  finite  data  window,  as  given  in  [17,  Appendix  C].  The  error  is  shown  to  be  in  the  sixth 
place,  thereby  insignificant.  In  the  subfigure  1(a)  we  see  that  the  Gaussian  estimator  error  deviates 
from  the  minimal  Cauchy  conditional  mean  error  in  the  Cauchy  simulation.  Also  we  note  that  the 
actual  conditional  standard  deviation,  generated  by  the  Cauchy  estimator,  fluctuates  dramatically 
with  the  Cauchy  noises.  For  the  Gaussian  simulation  of  Fig.  1(b)  both  the  Cauchy  and  the 
Gaussian  filters  perform  almost  identically,  although  the  conditional  standard  deviation  generated 


by  the  Cauchy  estimator  slightly  upper  bounds  the  actual  conditional  standard  deviation,  a  priori 
computed  for  the  Gaussian  estimator.  Numerical  results  for  the  three-state  Cauchy  estimator  can 
be  found  in  [14, 15,  Appendices  A  &  B], 


0  20  40  60  80  100  0  20  40  60  80  100 


Time  Steps  ( k )  Time  Steps  ( k ) 

(a)  Cauchy  Noise  Simulation  (b)  Gaussian  N oise  Simulation 

Figure  1:  Cauchy  and  Kalman  Estimators  for  a  stable  system.  Simulation  parameters  are:  ofi  = 
a2  =  0.8,  Y  =  0.5,  /3  =  0-1,  H  =  [1  1],  T  =  [0.5,  l]r,  and  the  eigenvalues  of  0  are  0.8  ±  .55 j. 
Heavy  lines  are  the  state  estimate,  light  lines  depict  the  predicted  standard  deviation. 


2.2  The  Stochastic  Model  Predictor  Cauchy  Controller 

A  stochastic  optimal  control  problem  is  formulated  where  the  conditional  performance  index, 
which  is  the  conditional  expectation  of  the  cost  function,  is  taken  with  respect  to  the  Cauchy 
conditional  pdf.  However,  since  only  the  characteristic  function  of  the  conditional  pdf  is  available, 
a  transformation  to  the  spectral  vector  is  required  which  then  leads  to  a  closed-form  expression  for 
the  conditional  expectation  of  the  cost  function.  In  order  for  the  conditional  performance  index 
to  be  analytic,  a  new  cost  function  is  chosen. 
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The  dynamic  stochastic  system 


xk+i  =  Q)xk+Auk+  T  wk,  zk  =  H xk  +  vk  (26) 

with  measurement  history 

z  k  =  {z\, ... ,zk. }  (27) 

is  the  same  as  (1)  except  for  the  addition  of  the  scalar  control  uk  and  known  matrix  A  e  Rn. 
To  determine  a  form  of  the  conditional  performance  index,  where  the  control  enters  in  a  more 
convenient  manner,  the  state  vector  is  decomposed  into  a  dynamic  system  forced  by  the  control 
and  a  dynamic  system  forced  by  the  underlying  random  variables.  Given  this  decomposition,  we 
show  that  the  control  has  to  be  adaptive  to  only  the  o-  algebra  generated  by  the  measurement 
history  associated  with  the  stochastic  part  of  the  state  decomposition. 

Consider  the  linear,  discrete-time,  scalar  stochastic  system  of  (26)  with  the  measurement  his¬ 
tory  given  by  (27).  Let  uk  be  adaptive  to  the  filtration  cr-algebra  ok  generated  by  the  measurement 
history  zk.  Filtration  implies  that  the  collection  of  cr-algebras  <Jk  have  the  property  that  if  j  <  k , 

then  <Jj  <2  Ok  [22],  Therefore,  filtration  is  the  evolution  of  the  cr-algebra  generated  by  measurement 
history  through  time.  Adaptation  means  that  the  control  is  a  measurable  function  of  events  on  this 
cr-algebra,  i.e.,  this  ensures  that  the  control  sequence  is  causal.  Now  consider  the  decomposition 
Xk  =  Xk  +  Xk  where 


xk=  OXk-i  +  wk-i,  Zk  =  Hxk  +  vk,  (28a) 

Xk=  OXk-i  +  Uk^,  a  =  Hxk.  (28b) 

Here,  xk  and  A  are  the  state  and  the  measurement  of  the  subsystem  containing  all  the  underlying 
random  variables,  i.  e.,  wk,  vk,  and  the  initial  condition  *o,  which  is  Cauchy  distributed  with  zero 
median.  Similarly,  X<  and  A  are  the  state  and  measurement  of  a  dynamic  system  driven  by  uk 
with  initial  condition  A- 

The  measurement  history  can  be  decomposed  as  zk  =  Zk  +  A  where 

A  =  {'A,  ■  ■  ■  ,  A},  A=  {A,  ■■■,%}■  (29) 

In  the  following  it  is  shown  that  the  control  is  measurable  on  events  generated  by  Zk  only. 

Theorem  1.  Consider  the  filtration  o  —  algebra  dk  generated  by  a,  with  the  decomposition  zk  = 
Zk  +  Zk-  For  A  e  dk  and  dk-\  c  dk,  a  is  adapted  to  dk- 1  and  uk  is  adapted  to  ck. 

Proof.  Start  with  k  =  0.  The  initial  state  is  decomposed  as  x0  =  Xq  +  X),  where  X)  is  a  given 
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non-random  parameter.  The  measurement  decomposes  as  z0  =  %  +  %,  where  X)  =  FIX)  is  a  given 
non-random  parameter  and  X)  =  X)  e  Qj.  Then,  u0,  which  is  determined  by  z0,  is  adapted  to  qj. 
At  k  =  1,  both  3<j  =  Oi()  +  u0  and  X  =  Fix \  are  adapted  to  qj,  and  thus  \  is  adapted  to  qj. 
For  the  measurement  at  k  =  1,  \  e  q,  \  e  q,  and  qj  c  q.  Hence,  since  U\  is  determined  by 
zi  =  Z]  +  z, ,  it  is  adapted  to  q.  Recursively  to  any  k,  2k  is  adapted  to  q~i-  With  zk  e  q,  and 
q_i  c  dk,  Uk  that  is  determined  by  Zk  =  zk  +  zk  is  adapted  to  q.  □ 


2.2.1  Cost  Criterion  for  Controller  of  Cauchy  System 


Since  dynamic  programming  does  not  have  a  tractable  solution  [18, 19,  Appendix  D],  a  multi- 
step  model  predictor  Cauchy  controller  is  proposed.  The  maximization  for  the  unconditional 
expectation  of  the  cost  function  is 


J* 


max  E  ip  xR+i  (jp- 1 

UP-'eF  ’  f 

=  E  max  E  ip  Xp  ,  Up~l  z 

iip-i^c  k+l  k  ^ 


f  I 

=  E  max  J±  =  E  J* 

Up~leF  k  * 


(30) 


where  pi  X^+l,Up  1  is  given  in  (31),  the  unconditional  expectation  is  assumed  to  exist  and  the 
expectation  is  nested  where  the  outer  expectation  is  over  the  measurement  history  and  the  inner 
expectation  is  over  all  other  random  variables.  The  operations  of  maximization  and  expectation 
have  been  interchanged  [23]  and  the  conditioning  on  zk  is  justified  by  Theorem  1.  The  arguments 

of  the  cost  function  are  the  projected  state  and  control,  given  as  XPk+\  :=  {Xk+i, . . . ,  xp }  and 
p- 1  p- i 

Uk  :=  {Uk, . . . ,  Up- Uk  e  F,  where  F  is  the  class  of  piecewise  continuous  functions  adapted 
tO  Zk. 


The  cost  function  for  the  Cauchy  dynamic  system  has  penalty  functions  of  the  Cauchy  pdf 
form  as 


I 


\ 


/tt 


,  m  n  r/TT  n  n 

V  Xlvur'  =  £u,(Ui)  ■  £x,.+1  (X/+1)  =  2  2 

i=k  i=k  Uj  +  £/  r=  i  Xi  Hi  i  I , r 


(31) 


To  compare  the  Cauchy  controller  to  a  Gaussian  controller,  an  exponential  cost  function  is  sug¬ 
gested  as  f  I 

p-l  12  1  2 

ip  XR+i,  fjp- 1  _  n  e  ~2nU,  .  g  —  2Cli+l,rXi+l,r  ’ 

i=k  r=  1 

A  solution  to  this  LEG  problem  can  be  found  essentially  in  [23],  but  it  is  explicitly  given  for  the 
scalar  problem  in  [19,  Appendix  D], 


11 


2.2.2  The  Conditional  Performance  Index  in  Terms  of  the  Spectral  Variables 


Since  only  the  characteristic  function  of  the  ucpdf  is  available  (See  section  2.1.5)  and  thereby  the 
characteristic  function  of  the  conditional  pdf,  the  conditional  performance  index2  can  be  rewritten 
by  applying  Parsevaf  s  identity  directly  [20, 21,  Appendex  E  &  F]  as 


lp-i  \  rx  rx 

ri  Q/n 


u 


,2  2 


oo  —CO 


i=k  i  +  C/ 

1  P~ 1  7  /  ^  t°° 

1  n  <7 tt  T 


£xp(Xp  +  Xp)  '  fxp\Zi<(Xp  \zkk)  dXpA  ■  ■  ■  dXp,n 


*Xp 


(2n)n  u2  +  C2 

I  i=k  i  ' 

Y  p~1  <7 n  ^  t130  r00*  n 


(y)  ■  <Pxp\^v)  dv  1 . . .  dvn 


— oo  — oo 


\ 


_1p,r  M  +f><P,rVr  (p~  ~(y)  dv}  .  .  .  dvn,  (33) 


i=k 


— oo  — oo 


r=V 


where  L*  refers  to  the  complex  conjugate  of  L.  Given  the  projected  form  of  the  characteristic 
function  of  the  ucpdf  (17)  and  that  =  £®<%;|z„(o,  Jz  can  be  evaluated  in  closed 

form.  In  the  next  section,  the  closed  form  of  the  conditional  performance  index  is  determined 
explicitly  for  the  scalar  system.  For  the  two-state  system,  the  conditional  performance  index  is 
found  in  closed  form  in  [20, 2 1 ,  Appendex  E  &  F] . 


2.2.3  Closed-Form  Cost  Criterion:  Scalar  State  System 


The  scalar  state  example  considers  only  a  weight  on  the  terminal  horizon  state  xp,  a  weight  on 


Uj  V/  e  {k, ...  p  -  l},  and  the  terminal  horizon  is  m  =  p  -  k.  The  cost  criterion  is 

1  lp_1  C /tt  X  r 

.  =  max  _  i*2Vt)  /  u2  2  e-nPW\+jXpV  cpXplz< (Y)  dv, 

‘  Uk  eF  ~  i-L-  i  +  S/ 


(34) 


i=k 


where  ( 1 9)  is  used  to  obtain  the  projected  form  of  the  characteristic  function  of  the  ucpdf  (p  (v). 

Xp|Zk 

The  cost  criterion,  found  in  closed  form  as 


k+2 


,  F*  1  ■  jj  1  a/(jfe  +  CT/)A7P  +  (bj-  a&Xno  +  uJi) 

Jrr  =  7F -  If  <U  ( X+Oy+ljf 


i=k  i  +  C/ 


P  i=l  ‘ 


P  ' 


,+  W.)2 


(35) 


is 


to  be  maximized  with  respect  to  Up  e  F  and  subject  to  the  deterministic  state  as  propagated 

)  > 


as  Xp=Qmxk+  Om  1  Auk+i-\. 

i=  1 


2For  simplicity  we  consider  only  the  terminal  state  in  cost  function. 
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2.3  Numerical  Examples  of  the  Scalar  and  Two-State  Stochastic  Con¬ 
troller 

To  obtain  insight  into  the  properties  of  the  Cauchy  stochastic  controller,  the  one-step  one-measurement 
example  is  first  analyzed  in  section  2.3.1.  Next,  the  multi-step  numerical  simulation  results  are 
given  in  section  2.3.2,  which  illustrate  the  Cauchy  controllers  behavior  in  the  presence  of  dominate 
Cauchy  measurement  noise  and  then  dominant  Cauchy  process  noise. 


2.3.1  One-Step  One-Measurement  Examples  for  Scalar  and  Two-state  Systems 


The  value  of  the  optimal  control  signal  at  k  =  0,  i.e.,  tv*,  as  a  function  of  the  first  measurement 
4),  is  determined.  Specifically,  we  examine  the  value  of  the  optimal  control  signal  at  k  =  0,  i.e., 
p(z0),  as  a  function  of  the  first  measurement  i§,  that  varies  due  to  the  measurement  noise  v0  [19, 
Appendix  D],  while  considering  the  one  step  horizon,  i.  e.,  m  =  1.  The  parameters  for  the  system 
and  Cauchy  signals  are  first  chosen  as 


0  =  1,  H=l,  cr  =0.1,  13  =  0.02,  y  =  0.5,  >&  =  0. 


(36) 


Initially,  no  penalty  is  introduced  on  the  control  signal,  i.e.,  the  term  ^/(tv2  +  C2)  is  removed  from 

ooo 

the  objective  function  in  (35),  i.e.  <fo  -+  00  while  the  state  at  k  =  1  is  weighted  with  rji  =  .7. 
Therefore,  we  first  examine  the  case  of  one  step  horizon,  i.e.,  m  =  1 .  Substituting  these  parameters 
into  (35),  the  performance  index  becomes 


0. 1148(4. 1667z2  —  1.0163u0z0  +  l)  +  0.03416(7.5820^  + 3.4153u0z0  -  1) 
(z2  +  0. 16  )(tv2  +  0.6724)  (z2  +  0.16)  ((i/0  +  z0)2  +  1 .22) 


(37) 


The  optimal  controller  can  be  obtained  by  minimizing  (37)  with  respect  to  Uq.  The  necessary 
optimality  condition,  dJ  /qUq  _  q,  reduces  to  finding  the  roots  of  the  fifth-order  polynomial 

Zo 


l5U5  +  /4tv4  +  l3U  +  liU1  +  /[  tv  +  /(,  =  0, 


(38) 


where 

/5  =  1,  /4  =  3.5 z(),  /3  =  (5.2315z2  +  3.6681)  /2=(3.6806z3+6.6305z0); 

o  o 

/1=(0.9491z4  +  3.2124z2  +  2.9623),  l0  =  (0.07782z2  +  0.3992)z().  (39) 

0  0  0 

This  polynomial  always  has  at  least  one  real  root.  If  three  roots  are  real,  then  there  are  two  local 
maximum  values  and  the  larger  of  the  two  gives  the  optimal  control. 
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Solving  the  polynomial  numerically,  the  optimal  control  signal  is  plotted  versus  the  measure¬ 
ment  if)  in  Fig.  2(a)  for  Co  =  00  as  well  with  weighting  Co  =  L  2,  3  and  y  >  a.  Also  included  is 

the  LEG  controller  where  u*  =  —x,  i.e.,  it  is  linear  in  z  (see  [19,  Appendix  B]  for  details  of  the 
LEG  model  predictive  controller).  The  Cauchy  controller  in  Fig.  2(a)  for  large  \  goes  toward  zero. 
This  is  in  sharp  contrast  with  the  LEG  controller,  which  remains  linear  in  the  measurement.  This 
is  a  significant  difference  in  behavior  between  the  Cauchy  and  Gaussian  optimal  controllers  that 
can  be  deduced  analytically  from  (38).  If  u*(%)  is  finite,  the  dominant  term  in  (38)  as  1^1  -»  °° 

is  /ii/*(°°),  or  limi^i^oo  u*(2q)  —>  0.  Therefore,  the  problem  of  handling  outliers,  which  occur 
for  the  Cauchy  pdf,  appears  to  be  resolved  by  the  Cauchy  controller  explicitly,  and  not  in  some 
filter  as  has  been  done  traditionally.  Note  that  the  controller  design  process  explicitly  uses  the 
parameters  /  >  cr,  i.  e.,  it  should  expect  more  impulsive  measurement  uncertainty  than  process 
uncertainty.  If  Y  <  a,  then  the  Cauchy  controller  behaves  approximately  like  the  LEG  linear  con¬ 
troller  in  Fig.  2(b),  i.  e.,  it  should  expect  more  impulsive  process  uncertainty  than  measurement 

uncertainty.  The  effect  of  reducing  <fo  from  00  to  5  has  no  effect  on  either  the  Cauchy  or  LEG 
controllers,  although  further  reductions  do  have  a  small  effect  as  seen  in  Fig.  2(a)  where  y  >  a 
and  approximately  linear  Cauchy  controller  for  y  <  a  as  shown  in  Fig.  2(b).  However,  for  y  <  a 
a  homotopy  optimization  method  [19,  Appendix  B]  keeps  the  optimum  value  on  the  ridge  that 
emphasizes  the  terminal  state,  rather  than  emphasizing  small  control  values.  This  occurs  even  if 
eventually  the  control  results  from  a  local  optimum  and  not  the  global  optimum. 


Figure  2:  Scalar  Cauchy  and  Gaussian  one-step  controller  with  parameters  variations  in  C  for 
y  >  a  left  and  y  <  a  right. 


For  a  two-step  prediction  of  a  two-state  stochastic  controller  in  [20,21,  Appendices  E  &  F], 
all  the  simulations  use  the  same  system  dynamics  with  H  =  [1  1],  F  T  =  [0.5  1],  AT  =  [0.5  1], 

m  =  2,  and  the  eigenvalues  of  ®  are  0.8  ±  0.55 j.  The  terminal  state  weightings  are  H  -  U  1], 
and  when  control  weightings  are  used  they  are  C  =  10.  The  initial  condition'  s  scaling  parameters 

are  given  by  a  =  [0.8  0-8],  and  the  process  and  measurement  noise  parameters  /3  and  y  are  either 
0.5  or  0.1.  The  first  set  of  examples  are  shown  in  Fig.  3.  These  figures  show  the  applied  optimal 
control  input  at  the  first  time  step  given  the  first  measurement.  In  the  two  cases  presented,  all  the 
systems  parameters  are  the  same,  except  in  Fig.  3(a)  y  >  (3  (i.e.,  more  measurement  than  state 
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uncertainty),  and  in  Fig.  3(b)  (3  >  y  (i.e.,  more  state  than  measurement  uncertainty). 


The  example  in  Fig.  3(a)  shows  that  the  Cauchy  controller  is  nearly  linear  for  small  measure¬ 
ments  and  reduces  its  control  effort  to  zero  as  the  measurement  deviations  become  large.  This  is 
in  contrast  to  the  LEG  controller,  which  is  linear  and  thus  responds  strongly  to  large  measurement 
deviations.  This  behavior  in  the  Cauchy  controller  occurs  when  the  measurement  uncertainty  is 
larger  than  the  state  uncertainty.  In  the  opposite  case  shown  in  Fig.  3(b),  the  measurement  has 
less  uncertainty  than  the  state.  Here,  the  Cauchy  controller’  s  response  closely  matches  that  of  the 
LEG  in  a  neighborhood  of  the  origin,  and  in  fact  responds  even  more  strongly  than  the  LEG  for 
large  measurement  deviations. 

The  three  different  curves  in  both  of  these  figures  depict  the  control  signals  for  three  different 
control  weights:  no  control  weight,  =  10,  and  =  5.  As  expected,  heavier  control  weights  (i.e. 
smaller  0  reduce  the  control  effort.  Even  without  any  control  weighting,  the  response  in  Fig.  3(a) 
goes  to  zero  for  large  measurement  deviations.  The  fact  that  this  behavior  is  seen  when  there 
is  no  control  weighting  implies  that  the  attenuation  of  the  control  signal  for  large  measurement 
deviations  is  due  to  the  cpdf  and  not  the  objective  function.  Moreover,  this  behavior  is  not  shared 
by  the  LEG  controller  that  uses  a  similar  objective  function  but  assumes  light-tailed,  Gaussian 
distributions. 


2.3.2  Multi-Step  Numerical  Example  for  the  Scalar  and  Two-State  System 

The  dynamic  characteristics  of  the  Cauchy  optimal  controller  of  the  scalar  dynamic  system,  ob¬ 
tained  by  maximizing  the  performance  index  in  (30),  are  explored  through  several  multi-step 
numerical  examples.  The  Cauchy  optimal  predictor  control  results  are  compared  against  the  least- 
squares  equivalent  LEG  predictor  controller  and  the  Kalman  filter  [19,  Appendix  D].  The  example 
that  is  discussed  in  this  section  is  a  stable  system  with  ®  =  0-95,  H  =  1,  and  a  horizon  length  of 
m  =  2.  The  state  weight  parameter  is  chosen  as  r/p  =  Hk+2  =  0.7,  while  the  control  weights  are 
chosen  as  <T/  =  8,  i  =  k,k+l.  The  noise  parameter  values  (3  and  y  are  interchanged  to  see  how 
the  controller  performance  changes  when  it  is  designed  for  a  large  measurement  noise  impulse  in 
contrast  to  when  it  is  designed  for  a  large  process  noise  impulse. 

The  simulations  results  are  depicted  in  Figs.  4  where  the  system  parameters  are  given.  First, 
for  /  =  0.1  and  (3  =  .02,  depicted  in  Fig.  4(a)  when  the  noises  are  small,  the  Cauchy  and  the  LEG 
controllers  exhibit  similar  performance.  However,  they  behave  rather  differently  when  a  large 
measurement  pulse  occurs.  A  measurement  noise  pulse  does  not  represent  a  state  deviation  and 
thus,  for  proper  regulation,  the  controller  should  ignore  that  measurement.  The  Cauchy  predictive 
controller,  designed  for  y  >  f3,  is  able  to  make  this  distinction,  whereas  the  LEG  predictive 
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Figure  3:  Optimal  control  vs  the  measurement  for  the  first  time  step. 

controller  reacts  linearly  to  all  the  pulses  and  does  not  differentiate  as  shown  in  Fig.  4(a).  At  time 
steps  k  =  2  and  k  =  13  process  noise  pulses  occur,  and  although  both  controllers  react  to  them 
and  are  able  to  overcome  this  deviation,  the  Cauchy  controller  does  so  much  quicker  than  the  LEG 
by  applying  a  much  larger  control  effort.  The  Cauchy  applies  a  larger  control  because  its  gain  for 
small  measurement  values  are  higher  than  that  of  the  LEG.  Conversely,  when  a  large  measurement 
pulse  occurs  at  k  =  51,  the  Cauchy  controller  ignores  it,  applying  almost  zero  control,  whereas  the 
LEG  controller  applies  a  very  large  control  input  that  causes  the  state  to  deviate  away  from  zero, 
which  then  required  additional  control  effort  to  correct.  This  way  the  Cauchy  controller  manages 
to  avoid  unnecessary  actuation  and  thus  maintains  the  system  performance.  When  Y  <  P,  the 
behavior  of  the  Cauchy  and  LEG  controller  is  similar,  as  shown  in  Fig.  4(b).  This  demonstrates 
the  same  linear  behavior  as  was  seen  in  Fig.  2(b). 

In  [20, 21,  Appendices  E  &  F],  the  two-state,  multi-step  example  uses  the  same  dynamic  system 
as  the  two-state,  two-step,  single  measurement  example  where  m  =  2  and  &  =  £k+ 1.  In  Fig.  5(a), 
there  is  more  uncertainty  in  the  state  measurement  noise  than  in  the  process  noise.  When  large 
measurement  deviations  occur  (such  as  at  k  =  52),  the  Cauchy  controller’  s  effort  is  very  small  even 
though  there  is  no  weighting  on  the  control  inputs.  In  contrast,  the  LEG  controller  responds  with  a 
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Figure  4:  2-step  Cauchy  and  Gaussian  controllers  with  /3  and  /  parameters  interchanged. 


large  control  effort  that  drives  the  states  from  their  regulated  state  of  zero.  When  the  measurement 
noise  density  parameter  dominates  the  process  noise  density  parameter  in  constructing  the  Cauchy 
controller,  the  effect  of  measurement  outliers  is  mitigated,  while  still  responding  to  state  deviations 
due  to  process  noise.  In  the  Gaussian  simulation,  shown  in  Fig.  5(b),  both  controllers  perform 
identically. 


3  Conclusions 

The  current  results  of  state  estimation  and  control  in  linear,  discrete  time  systems  with  addi¬ 
tive  Cauchy  noises  are  summarized  in  References  [11, 12, 14  -  18,20,21,24  -  28,  Appendices  A-F], 
Although  closed-form  solutions,  especially  for  the  state  estimation  problem,  are  presented  in  those 
publications,  they  entail  a  significant  numerical  complexity  that  may  limit  their  real-time  applica¬ 
tion  in  practical  engineering  systems. 

The  estimator  complexity  stems  from  the  representation  of  the  associated  characteristic  func- 
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Figure  5:  Cauchy  and  Gaussian  controller  performance  for  two-state  system. 


tions  of  ucpdf  as  a  continuously  growing  sum  of  terms,  each  of  which  depends  on  the  entire  history 
of  the  measured  data  and  intermediately  computed  parameters,  as  can  be  seen  in  the  references 
and  summarized  briefly  in  Section  2.1.5.  For  scalar  and  two-state  systems,  compact  and  simplified 
representations  were  found  to  represent  those  characteristic  functions  [12, 16].  Moreover,  for  the 
scalar  system  case,  efficient  methods  were  found  to  prune  the  terms  of  the  characteristic  function 
while  bounding  the  error  of  the  approximated  result  [12].  However,  by  using  a  fixed  data  window, 
measurement  data  can  be  streamed  continuously  with  insignificant  error  for  the  two-state  system 
[17,  Appendix  C].  Although  the  determination  of  the  controller  requires  a  numerical  optimization 
procedure  [20,21,  Appendix  E  &  F],  the  Cauchy  controller  demonstrates  how  to  handle  outliers, 
an  issue  that  has  plagued  designers  who  use  Gaussian  control  algorithms. 
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MULTIVARIATE  CAUCHY  ESTIMATOR 
WITH  SCALAR  MEASUREMENT  AND  PROCESS  NOISES 

MOSHE  IDAN*  AND  JASON  L.  SPEYER + 

Abstract.  The  conditional  mean  estimator  for  a  n-state  linear  system  with  additive  Cauchy  measurement  and  process  noises 
is  developed.  It  is  shown  that  although  the  Cauchy  densities  that  model  the  initial  state,  the  process  noise  and  the  measurement 
noise  have  undefined  first  moments  and  an  infinite  second  moment,  the  prohahility  density  function  conditioned  on  the  measurement 
history  does  have  a  finite  conditional  mean  and  conditional  variance.  For  the  multi-variable  system  state,  the  characteristic  function 
of  the  unnormalized  conditional  probability  density  function  is  sequentially  propagated  through  measurement  updates  and  dynamic 
state  propagation,  while  expressing  the  resulting  characteristic  function  in  a  closed  analytical  form.  Once  the  characteristic  function 
of  the  unnormalized  conditional  probability  density  function  is  obtained,  the  probability  density  function  of  the  measurement  history, 
the  conditional  mean  and  conditional  variance  are  easily  computed  from  the  characteristic  function  and  its  continuous  first  and  second 
derivatives,  evaluated  at  the  origin  in  the  spectral  variables’  domain.  These  closed  form  expressions  yield  the  sequential  state  estimator. 
A  three-state  dynamic  system  example  demonstrates  numerically  the  performance  of  the  Cauchy  estimator. 

Key  words.  Cauchy  probability  density  function,  non-linear  estimation  with  heavy  tailed  noises,  characteristic  functions 

AMS  subject  classifications.  93E11,  62L12 


1 .  Introduction.  In  many  engineering,  economic,  telecommunications,  and  science  applications  the  underly¬ 
ing  random  processes  or  noises  have  significant  volatility,  which  are  not  captured  by  Gaussian  distributions  [12]. 
Rather  than  light-tailed  Gaussian  distributions,  heavy-tailed  distributions  have  been  shown  to  better  represent 
these  volatile  random  fluctuations.  Examples  are  radar  and  sonar  sensor  noise  [6]  and  air  turbulent  environment 
noise  [9],  Our  objective  is  to  develop  a  filtering  technique  for  linear  dynamic  systems  with  heavy-tailed  distributed 
noises  while  using  a  particular  case  of  symmetric  alpha- stable  (Sa-S)  distributions  [  1 0] .  In  its  simplified  form,  Sa-  S 
distributions  of  scalar  random  variables  are  characterized  by  their  characteristic  function  (p(v)  =  e~a  Ma,  where 
o  and  a  are  parameters  and  v  is  the  spectral  variable.  In  this  class,  a  =  1  and  2  yield  the  Cauchy  and  Gaussian 
distributions,  respectively.  For  a  e  (0,  2),  all  the  densities  have  infinite  variance. 

In  the  detection  of  a  radar  signal  in  clutter,  the  in-phase  component  of  radar  clutter  time  series  agrees  extremely 
well  with  a  Sa-S  probability  density  function  (pdf)  with  a  =  1.7  [13].  For  a  e  [1,  2]  a  maximum  likelihood  Cauchy 
detector,  which  is  in  the  class  of  myriad  filters  [1],  exhibited  performance  that  is  very  close  to  the  Cramer-Rao 
bound,  whereas  a  maximum  likelihood  Gaussian  detector  deviated  significantly  as  a  varied  from  2  to  1.  The 
myriad  filters  or  detectors,  based  on  a  cost  criterion  derived  from  the  a-stable  pdfs,  show  significant  improvement 
in  detecting  a  signal  in  heavy  tailed  noise  over  Gaussian  detectors  [8],  although  they  are  numerically  intensive. 
Techniques  for  enhancing  the  estimation  robustness  performance  of  Gaussian  filters  attempt  to  mitigate  the  effect 
of  outliers  due  to  impulsive  measurement  and  process  noise.  These  estimation  techniques  adjust  to  the  measurement 
data  by  incorporating  ad  hoc  heavy-tailed  densities  into  the  estimation  scheme.  For  example,  [?]  robustifies  the 
Kalman  filter  by  replacing  the  Gaussian  densities  with  heavy-tailed  symmetric  Student’  s-t  densities,  which  behave 
much  like  the  Gaussian  about  their  medians.  Robust  filters  with  data-dependent  mean-square  error  recursions  [7] 
have  similar  properties.  Fortunately,  our  recursive  estimator,  without  approximation,  contains  these  robustness 
features  and  have  motivated  the  derivation  of  a  sequential  non-linear  estimator  for  scalar  linear  dynamic  systems 
with  additive  Cauchy  process  and  measurement  noises  [3,4],  Although  Cauchy  noises  do  not  have  a  well  defined 
first  moment  and  have  infinite  second  and  higher  moments  [5],  the  conditional  pdf  (cpdf)  of  the  system  state  given 
the  measurement  history  was  determined  analytically  and  shown  to  have  well  defined  finite  conditional  first  and 
second  order  moments  [3], 

Unfortunately,  the  recursion  scheme  for  generating  the  cpdf  directly  for  a  scalar  linear  system  [3]  does  not 
generalize  for  the  vector  state.  In  [4]  a  similar  estimator  was  derived  by  generating  the  characteristic  function  of 
the  unnormalized  cpdf  (ucpdf)  in  a  recursive  scheme.  This  approach  is  somewhat  simpler  than  the  scheme  in  [3], 
allows  for  a  stronger  result  regarding  the  decay  of  the  estimator  parameters  with  time,  and  can  be  generalized  to 
the  multi-variable  case. 

The  generalization  of  the  characteristic  function  of  the  ucpdf  approach  [4]  to  the  multi-variable  case  is  the  essence 
of  this  paper,  which  is  organized  as  follows.  We  begin  by  formulating  the  estimation  problem  for  a  n-dimensional, 
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discrete -time,  linear  system  forced  by  scalar  Cauchy  process  noise  and  a  scalar  measurement  with  additive  Cauchy 
measurement  noise  in  section  2.  In  section  3  we  show  how  the  characteristic  function  for  the  ucpdf  of  the  system 
state  conditioned  on  the  measured  history  is  computed  sequentially  for  the  first  two  measurement  updates  and 
a  time  propagation  step.  This  sequential  exposition  suggests  the  general  form  of  the  characteristic  function  for 
the  ucpdf  given  in  section  4.  Fundamental  to  obtaining  this  sequential  estimator  is  the  closed  form  solution  to  a 
convolution  integral  that  is  given  in  Appendix  B  and  used  for  the  measurement  update  at  each  time  step.  In  section 
5  we  prove  that  under  a  mild  condition  on  the  system  input  and  output  matrices  the  characteristic  function  of  the 
ucpdf  is  twice  continually  differentiable.  This  yields  closed  form  analytical  expressions  for  the  minimum  variance 
estimate  of  the  states  and  the  estimation  error  conditional  variance.  In  section  6  a  three  state  system  is  used  to 
illustrate  the  performance  of  the  sequential  estimator.  We  offer  some  concluding  remarks  in  section  7. 

2.  Problem  Formulation.  We  consider  the  single-input-single-output  multivariable  linear  system 

(2.1)  xk+ 1  =  Q)xk  +  T  wk,  zk  =  Hxk  +  vk, 

with  state  vector  Xk  e  Rn,  scalar  measurement  Zk,  and  known  matrices  0  e  Rnxn,  T  e  Rnx1,  and  H  e  R1xn.  The 
noise  inputs  are  assumed  to  be  independent  Cauchy  distributed  random  variables.  Specifically,  Wk  is  assumed  to 
be  Cauchy  distributed  with  a  zero  median  and  a  scaling  parameter  yS  >  0.  Similarly,  Vk  has  a  Cauchy  pdf  with  a 
median  of  zero  and  a  scaling  parameter  y  >  0.  The  characteristic  functions  of  these  scalar  noises  are  assumed  to 
be  time  independent  and  given  by 

(2.2)  (pw  cpv  v=e~v^v^. 


These  characteristic  functions  have  a  scalar  argument  v:  The  initial  conditions  at  k  =  1  are  also  assumed  to  be 
independent  Cauchy  distributed  random  variables.  Specifically,  each  /-th  element  xm  of  the  initial  state  vector  xi 
has  a  Cauchy  pdf  with  a  zero  median  and  a  scaling  parameter  or,  >  0,  /  =  1, . . . ,  n.  The  characteristic  function  of 
the  joint  pdf  of  the  initial  conditions,  which  is  a  function  of  a  n-dimensional  spectral  variable  v  e  Rn,  is  given  by 

■3  (  „  \ 


(2.3) 


<Px,  v  = 


CfiVi 

II- 


exp 


Cti  \Vi\ 


=  exp 


i  i 

Pi\(ai,v)\ 


\ 


+j(b-\,  v) 


/=  i 


/=  i 


i=i 


The  last  form  was  introduced  for  notational  convenience  to  be  used  in  the  sequel.  We  used  the  definitions 
(2.4)  p1  =  at,  a1  =  eh  i=l,...,n,  tf  =  {0}n, 

/  /  i 

where  e /  is  a  n-dimensional  /-th  unity  vector  and  {0}n  is  n-dimensional  vector  of  zeros.  In  fact,  any  distribution  of 
the  initial  conditions  can  be  handled  by  the  derived  estimator  as  long  as  its  characteristic  function  is  of  the  form 
given  in  (2.3)  with  any  vectors  a1  and  b^,  and  parameters  p\  >  0. 

The  goal  is  to  compute  the  minimum  variance  estimate  of  Xk  given  the  measurement  history  or 

yk  =  zi  z2  ■  ■  ■  zk  . 

3.  Initial  Derivations.  The  method  proposed  to  solve  this  Cauchy  estimation  problem  entails  propagating  the 
characteristic  function  of  the  cpdf  of  the  state  vector  given  a  history  of  measurements.  Evaluating  this  characteristic 
function  and  its  derivatives  at  the  origin  of  the  spectral  vector  v  will  provide  the  desired  state  estimate  and  its  error 
variance.  The  characteristic  function  is  initiated  by  the  expression  given  in  (2.3).  It  changes  during  a  measurement 
update  when  a  new  measurement  is  processed,  and  during  time  propagation  affected  by  the  process  noise  input.  We 
begin  by  showing  how  this  characteristic  function  is  computed  for  the  first  two  measurement  updates  and  one  time 
propagation  step.  This  will  suggest  the  general  form  of  the  characteristic  function  and  hence  the  general  estimator. 

3.1.  Measurement  Update  at  k  =  1.  Given  the  initial  characteristic  function  of  the  system  state  (2.3),  we 
are  interested  in  computing  the  characteristic  function  of  the  initial  state  xi  conditioned  on  the  initial  measurement 
zi  =  Hx i  +  vi.  The  desired  characteristic  function  is  given  by 

oo  oo 

(3-1)  <Px1|z1  v  =  ...  fXi|Zi  xi|zi  eJV  Xlcfxi, 


where  the  measurement  history  is  simply  yi  =  z i  and  the  cpdf  fx:  \z1  xi  |zi  is  computed  as 


fxi  I Z-1  Xi  I  Zi 


fx1Z1  X-|,Z-|  fZi  |Xi  Zi  |  Xi  fXl  Xi  fv  Zi  -  Hx  1  fx,  Xi 
fZi  Z1  =  fzi  Z1 


(3.2) 


fz,  z  1 
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with 

(3.3) 


Hence, 


(3.4) 


oo  oo 


OO  OO 

fz,  Zi  =  ■■■  fxiZi  x-\,z-\  dx i=  ...  fv  z\  —  A7xi  xi  c/xi. 


—  OO  — oo 


-oo  — oo 


oo  oo 


cpXl  \Zi  V  =T — —  ■■■  /v  zi  -  Hx  1  fx,  XI  e 


—  oo  — oo 


The  subsequent  derivations  can  be  simplified  by  avoiding  the  division  by  fz,  zi  in  (3.2)  and  (3.4).  Thus  we 
use  the  unnormalized  cpdf  and  its  characteristic  function  defined  by 

(3.5)  f  *i|zi  =  fx  \z  xi\zi  fz  zi  =  /V  z^  -  Hx^  fx  x^  , 

X,\Z,  11  1  1 

and 

OO  oo 

(3.6)  '  xi |Zi  ^  =  ■■■  fv  z^  -  Hx^  fx i  xi  elvT*'dx-\. 


When  the  actual  (normalized)  functions  are  needed  for,  e.g.,  computing  the  state  estimates,  the  normalization  factor 
can  be  easily  determined  by  evaluating  cp  xi  |zi  at  v  =  {0}n,  i.e.,  fz  zi  =  cp  {0 }n  ■ 

Xi|Zi  1  Xi  |Zi 

The  integral  in  (3.6)  resembles  a  Fourier  transform  of  a  product  of  two  functions.  Using  the  dual  convolution 
property  of  Fourier  transforms,  it  is  shown  in  (A. 7)  of  Appendix  A  that  for  a  scalar  measurement  zi,  and  thus  a 
scalar  function  fv  ■  ,  the  integral  in  (3.6)  can  be  expressed  as 


(3.7) 

where  cpv  ■  and  <px, 
can  be  restated  as 

(3.8)  f  v 

Xi  |Zi 


1  x  |z,  v  =  2^  <Px,  v-HTq  cpv  -n  eiz'ndr], 

—  OO 

were  defined  in  (2.2)  and  (2.3),  respectively.  Using  these  expressions,  the  integral  in  (3.7) 

oo  f(  \  I 

=  exp  -  n  p'\(a\v-HTn)\  +j(b\v  -  HT  q)  ~  Y\n\ +jz  n  dq 


Iff 

—  OO 
1 

ei(b  i.4 
2rr 


/=  i 


\ 


exp 


p^\(a\v)  -  Ha^ql  -y\q\+j  z\  -  Hb]  q  dq. 


/=  i 


This  integral  is  solved  using  the  methodology  presented  in  Appendix  B.l.  For  that,  the  coefficients  of  q  in 
the  absolute  value  term  of  (3.8)  have  to  be  normalized  to  one.  Clearly,  this  cannot  be  attained  if  some  of  these 
coefficients  are  zero.  Hence,  first  we  assume  that 


(3.9)  Ha]  /=(),  i=  \  , ...  ,n, 

f  I 

or  equivalently  that  all  the  elements  of  H  =  h-\  /?2  •  ■  ■  hn  are  nonzero.  With  this  assumption,  (3.8)  is  restated 

as 

j(b\v)  °°  ^  n+1  ^ 

(3.10)  g>  v  =  i  exp  p,v  q+j(q  dq, 

P 

XiIZi  2tt  i\(  /  )~  I  i 

-oo  /=1 

where  we  defined 

(3.11a)  p,  =  p1 1  Ha1 1 ,  p,  =  a\/(Hav), 

(3.11b) 


Pn+1  = 


Pn+1  = 


/=  l,...,/7, 
Ci=zi  -Hb\. 
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Using  the  results  in  (B.9)  and  (B.10),  while  associating  £;  and  z  in  the  later  with  (pi,  v)  and  Cp  respectively, 
the  integral  in  (3.10)  is  evaluated  as 

□  □  □  □ 


(3.12) 


% 

X!  [Z 


n+1  -iM  □  n+'\  □  □  n+ 1  □ 

v  =ei(b  Sf  psgn(p  p ,  v  )  exp  p  P  P ,  v  +y<f  P  ,v 


r  -  /  ;□  □- 


if  / 


^  Q1|1  will  111 

/  exP  ye/00  . 


Ill 

where  the  coefficient  functions  gf/  (■)  are  given  by 


(3.13) 


S'1'1  y1 1 1  (tz)  +  p+y1M(\/)  yf  p+y1M(v) 

f  gi  1  1  gi  -  i  -  /  gj 

ITT 


1  1 

The  arguments  of  gf  .  (■)  and  the  exponents  in  (3.12)  are 


(3.14a) 


t/^i  ( v)  =  P  sgn  (fp  -  Pi,  v))=  p  sgn  (fa,  ,  i/j) , 


(3.14b) 


(v)  =  -  P  \(P  ~ Pi,  v)  |  +j( <fi pi  +  b\v)  =  -  p  | (a-, ,  v)  |  +VW11 ,  vj. 


where,  while  using  (3.11),  for  £/=  /  we  have  defined 

u  a1  a,1  //=  n  +  1 

Hfll  Ha1  £/=  n  + 


zi  -  Hbp 


a/  =p  -pi= 


i=n+l  ,  bj  =  £-ipi  +  rf  = 


£-  n  +  1 


'i  +/?!  i/=n+  1 


/  =  n  +  1 


To  simplify  the  notation  and  subsequent  derivations,  specifically  to  avoid  the  £  /=  /  exclusion  in  the  sums 
of  (3.14),  the  elements  in  these  sums  are  renumbered  sequentially,  i.e.,  £  e  [1,  n]  for  each  /- th  term.  The 

renumbered  vectors  a/  will  be  denoted  by  a  .To  accommodate  subsequent  derivations,  the  coefficients  p  in 
the  sums  of 

(3.14)  will  be  marked  differently  for  y  (■)  and  y  (-):  we  will  use  q  for  the  former  and  p  for  the  latter.  The 

gi  e/m  i|i 

parameters  z  and  p/  in  (3.13)  will  be  denoted  by  c ,  and  c/,-  ,  respectively.  Finally,  the  number  of  terms  in  the 
sum  of  (3.12)  will  be  denoted  by  fT  =  n  +  1,  while  the  number  of  elements  in  the  sums  of  (3.14)  will  be  marked 

,  i |i  t  1 1 1 

Dy  '  ei  =  n.  Although  at  this  stage  all  the  counters  nei  are  the  same,  as  will  be  seen  in  the  sequel,  they  may  be 
different  for  different  /- s.  Therefore  we  have  introduced  /-dependent  element  counters  n  .  Moreover,  although  the 

1  1 1  1  |  1  r  e/ 

counters  n{  and  n  seem  to  be  related  to  each  other  (the  former  is  n+  1  while  the  latter  are  n),  they  will  exhibit 
irregular  changes  and  thus  were  introduced  separately. 

With  these  notations  and  renumbered  parameters  and  vectors,  cpX:  |Zl  v  is  restated  as 


(3.16) 


i/  ="£  g1M  y1M(\/)  exp  y1!V)  , 


where 

(3.17) 


=1  j ^'Wyli'M  " 

and  h  g*  y2 


Jli  c/1 1 1  +y1 1 1  (tz) 


1/.1  n 
yfl\v)  =  qr1 1 1  sgn  fa1'1,  v)  ,  y1M( v)  =  -  p1 11  Ifa1  l\ v)  \  +j(b^ M,  v). 


(3.18) 
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It  is  interesting  to  note  that  the  initial  characteristic  function  in  (2.3)  is  also  expressed  in  a  form  identical  to  that 
given  in  (3.16).  The  initial  parameters  are:  n1  =  1;  g 1  =  1  (and  thus  there  are  no  y1  i  arguments);  n1  =  n\  the 

parameters  p1  and  the  vectors  a1  and  b 1  are  given  in  (2.4). 

/  /  i 

3.1.1.  Special  Case:  Ha 1  =  0.  If  one  or  more  of  the  Ha1- s  are  zero,  i.e.,  one  or  more  of  the  entries  of 

'  1  1  ' 

the  matrix  H  are  zero,  then  the  corresponding  terms  would  come  out  of  the  integral  in  (3.8).  However, 

the  other  manipulations  used  to  solve  this  integral  would  remain  the  same.  For  the  sake  of  simplicity,  this  is 
demonstrated  by  assuming  that  only  Ha1  =  0,  while  Ha1  /=  0  for  /  =  2,  ■ . . ,  n.  In  this  case,  (3.8)  and  (3.10)  are 

i  / 

restated  as 


(3.19) 


Xil  Z,  V  - 


p1  a\v)  \+j(b\v) 

e~  i  K  i _ 1 


\ 


2  TT 

1  1  1 
e-Pi  |('a1,vj|+/pb1,vj 

2rf 


exp 


p]\(a),v)  -  Ha1/?! 


+y  zi  —  Hb\  n  drj 


—  oo 
oo 


/=  2 


\ 


V  /7+1 

exp  _  p,irp/,i/; -a? |  +fan  drj, 
/=  2 


where  the  parameters  p/,  p/,  /  =  2,  ■  ■ . ,  n,  and  <fi  were  defined  in  (3.11).  The  integral  in  the  above  is  solved  using 
the  same  technique  as  in  the  case  when  the  condition  in  (3.9)  holds,  yielding  the  result 


(3.20) 


n+ 1 


=  e-^:,v)\  9  (yj 


HU,,  (v)) exp (y  (V)), 


/— 2 


111  111 

where  g  (•)  are  given  in  (3.13).  The  arguments  of  g  (■)  and  the  exponents  in  the  sum  of  (3.20)  are 


(3.21a) 

(3.21b) 


ygi(v)  =  p  sgn  (( a , ,  v))  =  g1 1  ^gn 

/ 

=2  =2 
/=/ 

n+1  in 

ye/(i/)  =  -  P  Ifa, ,  i/J  |  +j(bn\  v)  —  — 

/ 

=2 

/=/ 


(a1|V;  , 
/ 


p111  fa111,  i/J  +/b1M,  1/J, 
/  /  / 

=2 


with  vectors  a/  and  b ,  defined  in  (3.15).  When  ordering  the  elements  sequentially,  thus  avoiding  the  £/=  / 

111  111  111  111 

exclusions  in  the  sums  of  (3.21),  we  obtain  the  parameters  q  ,  p  ,  and  rectors  a  and  b 

^hp  result  in  (3.20)  can  be  cast  in  the  form  of  (3.16)  by  the  following  modifications.  First,  the  exponent 
e~Pi  \(a-\’v)\  outside  of  the  sum  is  combined  with  the  exponents  in  the  sum  by  modifvjng 1  die  argumentVe/fv),  i.e., 
adding  one  more  element  in  the  sum  of  (3.21b).  This  element  is  denoted  by  —  P-,  (a^  ,v)  with  P-/  =  p1  and 

a111  1 

i  =  a)  for  all  £.  Consequently,  the  sum  that  defines  the  argument  to  the  exponents  will  have  n  elements.  Second, 

1  I  1 

for  consistency,  the  sum  of  (3.21a)  is  expanded  by  one  by  introducing  q  =0  for  all  £.  Finally,  the  terms  in  the 
sum  of  (3.20)  are  ordered  sequentially,  starting  from  one.  As  a  result,  cpx 1  |Zl  v  for  this  special  case  is  expressed 

in  a  form  identical  to  that  of  (3.16)  when  condition  (3.9)  holds,  with  the  only  difference  being  that  n  t  is  reduced 
111  1 
by  one  to  n  =  n.  The  manipulations  above  can  be  extended  to  the  cases  where  more  than  one  Ha1  ^s  zero,  hence 

t 

further  reducing  the  number  of  terms  used  to  express  cpXl  \z1  v 

3.2.  Time  Propagation  to  k  =  2.  Given  the  time  propagation  equation  X2  =  Oxi  +T  w/i  we  want  to  compute 
the  characteristic  function  of  the  ucpdf  of  X2  given  zi.  Using  the  results  of  Appendix  C,  (C.4),  that  characteristic 
function  is  given  by 


(3.22)  cp  v  -  0Tv  <pw  TTv  - nt  g111  yl|1(07V)  exp  y^\oTv)  e 

(p 

X2IZ1  Xi|Zi  /  gi  ei 

i=  1 
1/1 

_  nt  0i|i  win  w 

/  g/(Ori/)  exp  ye,.(0Ti/)-/3 \(T,v)\  . 

i=  1 


The  last  expression  in  (3.22)  can  be  expressed  in  a  form  that  is  similar  to  the  one  given  in  (3.16).  For  that,  we 
make  the  following  observations  that  lead  to  the  definitions  of  new  and  time  propagated  parameters  and  vectors. 
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-  —  211111 

1 .  cp  i/  is  expressed  with  the  same  number  of  terms  as  (p  v  .  Hence,  we  define  n  =  n  . 

X2 1  Zi  1  ^  Xi  |  Zi  1  ^  1  j  1  t  t 

2.  The  coefficient  functions  g  .  (■),  or  specifically  the  parameters  c  .  and  d .  ,  remain  as  in  (3.17),  i.e., 

_2|1  1|1  2 1 1  '|i  _  '  2  1  '  '  1 1 1 

/  =  c i  ,  d,  =  d,  ,  i  e  [1, . . . ,  nt  ].  There  are  only  changes  in  the  arguments  of  g,  (■)•  For  notational 

211 

consistency  we  will  denote  the  updated  coefficient  functions  g  /  (■)  while  remembering  that 


(3.23) 


g2|1(WM(-). 


Ill  1  I  1  -|  M  1  M  2|11|1 

3.  The  parameters  g  and  p  used  to  define  y  (-)andy  (■)  in  (3.18)  are  also  unchanged,  i.e.,  p  =P  , 

'  i|i  91 

<7,  =qt  ,  i  e[\,.  nt  l  £^A\,...,neil  in  m 

4.  The  arguments  of  y  (■)  and  y  (■)  are  multiplied  by  O hence  affecting  the  vectors  a  and  b  as 


(3.24a) 


11  ei  ei  2  1 

1/yi(<t>7V)  =  q  1 1 1  sgn  (a,  ,®Tv)  ql11  sgn  v)  q2|1sgn  (a,-  >  W 


(3.24b)  ^(<S>Tv)  =  -  p^\(<S>a^\v)\+j(0b^,v)  =  -  \(af\  v)\  +j(b^\  v). 


Here  we  used  the  definitions  a211  =  Oa1  b211  =  Ob1  /  e  [1, . . . ,  a211],  £  e  [1, . . a1 11], 

/  /  /  /  if  e/ 

5.  The  exponents  are  a  function  of  an  additional  elenreqt  —  /3J  TT ,  1/)  | .  Hence,  the  number  of  elements  that  define 
the  new  argument  y  1  (■)  increase  by  one,  i.e.,  n  =  a1  n  + 1.  The  parameters  and  vectors  that  define  these 

ei  ei  ei 

new  elements  are  P2j\n  =  P-  =  T ,  i  e  1 ]■  For  consistency,  and  to  facilitate  the  subsequent 

impipulations  of  the  characteristic  function,  the  number  of  elements  in  |^e  sum  of  the  new  ^l'guments 
g/  (■)  are  also  increased  by  one,  while  introducing  the  zero  parameters  q  ^  =  0,  i  e  [l, . . . ,  nt  ].  Hence, 


Qj  \  )  cue  cu^u  meicctocu  uy  unc,  winic  liiiiuuueiiig  uic  z,ciu  jjcu cuncicib  <-/  a±i  - 

^  2|1  2|1  "'ei 

with  these  new  elements,  the  arguments  y  (■)  and  y  (■)  are  defined  as 

gi  ei 


(3.25a) 


^{v)=  q2 11  sgn  (a2\\v) 


(3.25b) 


y2|  1  n  2 1 1  2 1 1  2 1 1 

ei  (V)  =  yli \(bTv)-p\(r,v)\  =  -  Pi  I  (a,  ,v)\+j(bi  ,v). 

=1 


Using  all  the  time  propagated  and  newly  defined  parameters,  (3.22)  is  restated  as 


(3.26) 


where 


(3.27) 


q p  v  ="£  g2M  y2|V)  exp  y^V)  , 

X2IZ1  /  gr/  e/ 

/=  1 


9211  y21  V)  =  1 


ycf’+df+y'V) 


W  _d2|1+y2|V) 

/  /  g/' 


with  arguments  y2  (■)  and  y2  (•)  given  in  (3.25).  Overall,  we  have  obtained  a  form  which  is  similar  to  the  one  in 

gi  ei 

(3.16),  determined  after  the  first  measurement  update  in  the  previous  subsection. 

Remark  3.1.  It  may  so  happen  that  for  a  given  i  several  vectors  a211  with  £  c  fl, . . . ,  n2  |  could  be  co-aligned. 

For  example,  if  there  are  two  such  vectors  a211  and  a2}\  £/=  £  they  are  related  by  a  non-zero  constant  9  ,  i.e., 
a2-  =  9  a2!1,  which  implies  that 


(3.28a) 


sgn  (a_,v)  =  sgn(9  )sgn  fa2,1,vj 


\(ar,v)\  =  \9  \-\(a,  ,  v)\. 


(3.28b) 
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Consequently ,  in  the  sums  of  (3.25)  such  two  terms  can  be  combined  as 

(3.29a)  ^iisgn  ($l\v)  +q2j^sgn  faUf  v)  =  cfj1sgn  (e%\\v) 

2|1  2 1 1  2 1 1  2 1 1  2 1 1  2 1 1 

(3-29b)  /  I  fa,  ,v)\+  p~  I  (a-  ,v)\  —  pi  \(a,  ,v)\ 

with 

2|1  2 1 1 

(3.30a)  =q  +Sgn(6)q~ 

(3.30b)  P  =Pi  +|e  | p-, 

thus  reducing  n2e]\  the  number  of  elements  in  the  sums  of  (3.25),  by  one.  If  there  are  more  than  two  co-aligned 
vectors  a,211,  or  if  there  are  several  groups  of  such  co-aligned  vectors,  the  above  procedure  can  be  repeated  for  all 
those  occurrences  thus  further  reducing  n2^. 

In  the  sequel  we  will  assume  that  all  co-aligned  vectors  were  combined.  This  minimizes  the  number  of  elements 
needed  in  (3.25)  and  contributes  to  the  efficiency  of  the  resulting  estimator. 

3.3.  Measurement  Update  at  k  =  2.  In  this  subsection  we  perform  a  second  scalar  measurement  update  at 
/c  =  2  using  the  second  measurement  22  =  HX2  +  V2.  Specifically,  we  will  compute  <Px2 1 y2  v  ,  where  y2  =  zi  Z2  . 
Using  the  general  convolution  results  of  (A.7)  in  Appendix  A, 

oo  ^  oo 

(3.31)  <X2|y2  v  =  jfT  >  Xz\ z,  v-HTn  (pv  -n  e/Z2ndn=2ff  <  x2|Z!  v  -  HT q  exp  -y\n\  +jm  dp, 

—  OO  — oo 

where  from  (3.26) 


(3.32) 


cp  v-H'n  =  g 

X2\Zi 

/■=  1 


y2|1 


v-HTn  exp  y211  v-H'n  , 

ei 


gi 


g 


,2H 


/  (■)  is  given  in  (3.27),  and  from  (3.25) 

2 /I 

(3.33a)  /M  v-HTn  =  q2M  sgn  (a2{\v  -  HJ  q)  , 


gi 


=i 


(3.33b) 


y 2,1  v  -  Ht  q  =-  p2|1|(a211  ,v  -  HT  q)\  +j(b2^  ,v  -  HJ  q). 


=i 


The  integral  in  (3.31)  is  solved  using  the  methodology  presented  in  Appendix  B.2.  For  that,  the  coefficients 
of  n  in  the  absolute  value  and  sign  terms  in  (3.31),  or  more  specifically  in  (3.33),  has  to  be  normalized  to  one, 
assuming  those  coefficients  are  not  zero.  Hence,  we  first  assume  that  F/a2/1  /=  0,  V  i,£.  The  case  that  several 
l_la3\ i=  0  will  be  addressed  at  the  end  of  this  section.  Next,  we  perform  the  following  manipulation  and  introduce 
the  intermediate  variables 

2 1 1  T  2 1 1  2 1 1  2 1 1 

fa,  ,v  —  H  q)  =  (a,  ,  v)  -  Ha,  q  =  Ha ,  (fp,  ,v)-q), 

^l^gn  fa21 \v-HJq )  =  Q2 1 1  sgn  (fp/  ,v)  -  q), 

n2|1  2 1 1  T  2 1 1 


(3.34a) 

(3.34b) 


(3.34c) 

where 

(3.35) 


/  I  fa,  ,v  —  H  q)  |  =  pj  \(pi  ,v)  —  q\, 


/ 

Pi  =  a2'1  Ha2'1  ,  =  qf"sgn  Ha""  ,p‘"=p‘"\Ha 


■>2|i 


2M  _„2|1|Ha2|1| 


Using  these  variables,  (3.33)  is  restated  as 


(3.36a) 


y2'1  v  -  HT q  =  ' Q2 11  sgn  (fp/  ,v)  —  q), 


gi 


n2 1.\ 


y211  v  -  HTq  =-  p2|1|fp/,i/J -q\  +j  (b2]\v)-Hb2]'q 


(3.36b) 


= i 
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Substituting  (3.32)  and  (3.36)  into  (3.31)  yields 

oonT  o_y?  ! 


(3.3*7)  cp  % 

XMV=Tt,  ,-,9' 


,2|1  □“  q2|  sgn  {(p  ,  v)  -  r\)  { 


-  „|/;1 

x  exp  -  -  p2^\(pi,v)  -ri\  +j  (b2l\v)  -  Hb2l]ri  3eXp  -y\n\+jz2n  dp. 

/  /  / 

=  1 


Combining  the  exponents,  while  interchanging  the  integral  with  the  summation,  (3.37)  is  restated  as 


(3.38) 


x2\y2 


nV'  j(b2'\v)  °°  D<'1+1  C  C  "2i1+1  9|1 

e  L  g,2|lD  Q  1  sgn((ju/  ,v)  -  /7)n  expD-  pf  \(Pi  ,v)-n\+Km  dr\. 


v  = 

i=  1 


=  1 


=  1 


In  the  above  we  defined  the  new  parameters 

(3.39)  p2|1  =  y,  Pi  =  {0 }n,  C/  =  Z2  -Hb2]\  £=d2|1  +  1. 


2 1 1 

In  addition,  for  consistency,  the  member  of  the  elements  in  the  argument  of  the  functions  g  ,  (■)  was  increased  by 

one  by  introducing  Q  =  0 ,  £  -  n  +1. 

/  e# 

Each  integral  in  (3.38)  has  the  form  of  the  general  integral  in  (B.  11),  when  associating  £  and  z  with  (pi  ,  v) 
and  0,  respectively.  Their  solution  was  derived  in  (B.15)  of  Appendix  B  and  is  given  by 


□ 


(3.40)  cp 


x2\  y2  v 

□ 

X  exp 


nW  +  l  n2|1 


g212 

^  i  m  u 


n2'U  1 


Q^'s. gn(P  _p  '  p2|1sgn(//  _p  ,v 

i  (  /  im  J  /  (  /  /m  J  □ 

=  1  =1 

/=m  □  /=m 


i=1 


n2/1  . 


=  1 
/=m 


m=  1 


7 


2/1  n2/1 


P2 ' 1  I  (Pi  ~  Pim,  V)  |  +j(i(, pim,  V)  "  = 


□ 


/o 


>2 i 2  n>|2  (p),y212  (\/)  exp  y212  (v)  . 

im  y  gim'\  gim2  eim 


i=  1  m- 1 


2 12 

The  coefficient  functions  gim  (-,  ■)  that  have  two  arguments  are  given  by 


(3.41) 


212  212  212  1  nCd211  y212  W  +  Q2'1  g211  y2|2  (l/)  “  Q 2 II 

g  yr'  (v),y  (V)  =  D  y  ygim  1  im  _  ^ gimA  im 


im  gim 1  gim2 


2TT  K  ,+p2'1  ,  2'2 


2ft - 2f2- 


□  . 


+  ^  /m  +yg/m2('/)  _  P/m  +  yg/m2('/) 


The  arguments  of  these  coefficient  functions  and  those  of  the  exponents  in  (3.44)  are  given  by 


(3.42a) 


(3.42b) 


(3.42c) 


1/grt?n2('/) 

y2|2 


,«1+1 

n2/1  +1 
ez 

Q2|1sgn((p; 

-  Pim,  V))  = 

Q.  sgn  {(aim  ,  v)), 

=i  7 

=i  7 

/=m 

/=m 

,2)E1  +  1 

n^+l 

ei 

p2|1sgn((p/ 

-  Pim,  V))  = 

p2. 11  sgn  (.(aim  ,  v)), 

=i 

=i 

/=m 

/=m 

n2l'+ 1 

n2,1  +  1 

e‘  2 1 1 

2 1 1 

2 1 1 


{v)  —  Pi  |  (pi  pim,  v)\  +  j(£iPim  +  bj  ,v)  —  Pj  \(aim,v)\  +j(bim,v), 

=  1  =  1 


/=m 


/=m 
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where,  while  using  (3.35)  and  (3.39),  for  £/=  m  we  have  defined 

□ 

,2M  o2|  1 


_na 


□  ' 


Ha 


2|1 


Ha„ 


2 1 1 


m/=n  2 1 1 

I/1  +  1>  £/=  nei  +  1 


(3.43a) 

S//77  —  l^i  1-1  im  — 

D  a2;1 
— 

m  =  n2' 1  +  1 

B  uJH 

Haim 

9M 

£  =  n21.1  +  i 

"  Z2-H6211 

Q  • 

+b211 

+  1 

(3.43b) 

bim  —  +  b  ^  — 

62'1 

' 

ei 

2 1 1 

/ 

m  =  nei  +  1 

To  avoid  the  double-summing  over  /  and  m  in  (3.40),  all  the  (-)/m  terms  are  now  re-ordered  sequentially  with 

A#1 

■  212  />  211 
one  index  /.  Its  range,  or  the  number  of  terms  in  this  single  sum  will  be  n  ,  =  (n  ■  + 1 ).  In  this  reordering, 


211  2  1 2  ^ 

one  has  to  keep  track  which  parent  g  .  (■)  is  used  to  compute  the  updated  g  [  (-)•  This  will  be  done  by  storing 

the  parent-term  index  in  the  variable  r  1  .  In  addition,  the  offsets  ±Q  .  used  to  compute  g  .  (■)  in  (3.41)  will 

be  denoted  by  h  1  .  The  imaginary  parameter  £/  and  the  real  parameter  p  |  in  the  denominator  of  (3.41),  when 
/  212  212  o  12  212 

re-ordered  sequentially,  will  be  denoted  by  c  and  c/  ,  respectively.  The  re-ordered  arguments  y  (v),  y  (v), 

o  |2  212  2\i  '212  9'm1  9'/m2 

and  y  ((/)  will  be  denoted  by  y  ((/),  y  (i/),  and  y  (v),  respectively.  These  arguments  are  defined  in  (3.42)  by 

eim  gi  1  g/2  ei 

sums  that  have  the  £  /=  m  exclusion.  To  avoid  that,  the  parameters  in  these  sums  will  be  re-ordered  sequentially, 
while  accounting  also  for  the  sequential  (/,  m )  ordering  discussed  above.  The  number  of  terms  in  those  sums  will  be 

denoted  by  n  '  The  re-ordered  vectors  a/m  and  b  \  will  be  denoted  by  a  .  and  b /  ,  respectively.  The  re-ordered 
parameters  Q™  of  (3.42a)  and  p.  of  (3.42b)  will  be  denoted  by  q2  2  and  q2  2 ,  respectively,  while  p2  1  of  (3.42c) 

•  1  i  1  /  2  / 

2  2 

will  be  marked  by  p  7  . 

With  these  substitutions,  (3.40),  (3.41)  and  (3.42)  are  restated  as 


(3.44) 

n2J2 

g2'2 

y2|V),y2|V) 

exp  y2|2(i/)  , 

x2|Y2 

/=  1 

/ 

g/i  g/2 

ei 

□ 

□ 

(3.45) 

g2'2  Z'WV) 

1  □ 

r0221 

11  /'V)^212 

+  y  (V) 

y/5i  -t? 

/' 

1  y  V  / 

+  c/ 

+  y  (i/) 

i  gi  1  g/2 

277  - 

yc2'2 

CM  ! 
CM 

CM 

CM  ' 

2 1 2  2 1 2  2|2  [ 

/  g/^ 

'  ‘  g/2 

and 


(3.46a) 


(3.46b) 


n2J? 


, ( W  =  qf  fsgn  Ca2 12,  y; 

y2|2 


=  1 

2 

A7^ 


2 1 2  2 1 2 


&v)  =  qj'z  Sgn  , 


=i 


2 1 2 


00  =  -  Pi  ( a ;  ,v)  +j(bf  ,v). 


212 

Note  that  the  coefficient  functionsrP  (■,  *)  have  ttyo  arguments.  It  is  grouped  into  one  vector  argument  with 

two  components  denoted  by  y2|2(v)  =  y21  (v)  y2|2(v)  7  .  It  is  defined  by  combining  the  two  equations  of  (3.46a) 

gi  gi  1  g/2  r 


212  212 

while  using  a  two  dimensional  vector  of  parameters  q  -  q  ' 


are  restated  as 
(3.47) 


/ 1 


\T 

2\2  .  With  these  definitions,  (3.44)  and  (3.45) 

w  2 


c p  v 

x2\Y2 


=  '  g212  y2[2(v )  exp  y2|2(i/)  , 


9' 


1=1 
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g2|2  2,2  =  1  □  9%  fj«(v)  +  h2 j2  _g^2 

i  gi  iff-  -0r \2 - 2f2 - -  — m - 2+2 - 2+2 - 

7  /  +d;  +yg/2('/)  jC/  ~d,  +ygi 2(v) 


where  y21^-)*  t-  1,2,  denotes  the  f-th  element  of  the  vector  y2  2(-)-  Similarly,  (3.46)  are  restated  as 
gtf  g/ 

n2/2  2/2 

(3.49)  y2|V)=  V|2sgn  (a2'2,v)  eR  2,  y21  V)  =  -  ?|2|fa2'2,  vj|  +/fb212,  v* 


Remark  3.2.  A/ote  that  the  coefficient  functions  g  1  (■)  and  thus  their  parent  functions  g  1  (*)  are  computed 

i  2/2 

using  the  updated  arguments  y2|2( v )  of  (3.49).  Hence ,  especially  when  evaluating  the  parent  g2|1\-)>  one  does  not 

Q1'  £-2/2 

use  the  old  arguments  y2^(v)  of  (3.25),  but  rather  the  updated  ones  of  (3.49)  with  the  offsets  tr’'2. 

gi  i 

Remark  3.3.  Due  to  (3.23)  one  could  restate  (3.48)  as 

(3  50)  O212  V2|20/1  1  a9l'’2  y2Ji('/)  +  /?2'2  ~g% 

(3.50)  g  y  (v)  = _  ri  v-'  +y  lw^  jfc  _ □ 

i  +d  +y  (y)  ’ 


2n  □  /c2!2  212  2|2 

/  g/2 


2|2  2 | 2 


which  implies  that  one  would  have  to  store  only  the  parameters  c'\'j2  and  c/1^  of  the  measurement  updated  coefficient 

'  i  ^  i 

function  g111  (■)  and  not  the  time  propagated  ones  (which  are  in  fact  the  same.) 

rZI2 

Remark  3.4.  Similar  to  the  discussion  presented  in  Remark  3.1,  for  a  given  index  i,  several  vectors  a2!2  with 
2 1 2  2 1 2  1 
£  c  [1  ,...,nei]  could  be  co-aligned.  In  such  cases,  in  the  sums  of  (3.49)  the  elements  with  co-aligned  a,  -s  can  be 
combined  using  the  steps  presented  in  Remark  3. 1,  thus  reducing  the  number  of  elements  in  those  sums,  i.e.,  reducing 

nl\2.  For  numerical  efficiency,  we  assume  that  all  those  co-aligned  directions  are  combined,  hence  minimizing  the 
number  of  elements  a2.12  in  the  sums  of  (3.49). 

eix 

Remark  3.5.  The  procedure  for  handling  the  special  case  when  one  or  more  of  the  Ha2'/]-s  are  zero  is  similar 
to  what  was  presented  in  subsection  3.1.1.  The  details  are  not  presented  here  for  brevity. 

212  rr 

In  summary,  the  measurement  updated  1  y2  v  ^of  (3.44)2js  expressed  by  n^  terms.  The  coefficient  functions 

(■)  are  defined  in  (3.50)  by  the2set  of  parameters  c,-  and  c/(-  ,  the  offsets  h,  ,  and  the  indqx  r,  of  the  parent- 
term.  Alternatively,  the  index  r'  can  be  replaced  by  storing  the  parameters  that  define  g  (■),  i.e.,  c  and 

/  2/2  r2l2 

111  2 |2  *  i  ‘  i 

d  .  The  two  dimensional  vector  input  arguments  of  g  (*)  and  the  scalar  arguments  of  the  exponentials  in  (3.44) 
2/2  / 

'  i  2 1 2  2 1 2 

are  determined  by  (3.49)  while  using  the  measurement  updated  vectors  a  and  b  ,  together  with  the  vector 
212  ,  ,  212  i  i 

parameters  q  and  scalar  parameters  p 

/  / 

3.4.  Summary  of  Initial  Results.  The  above  derivations  demonstrate  that  the  characteristic  function  of  the 
ucpdf  of  the  state  Xk  at  time  steps  k  =  1  and  2  is  expressed  as  a  sum  of  nt  weighted  exponential  terms.  The  number 
of  terms  increases  during  a  measurement  update  and  are  unchanged  during  a  time  propagation  step.  In  each  such 
term  /,  the  exponents  and  their  weights  or  coefficients  are  functions  of  a  sum  of  nei  elements.  The  number  nei  of 
elements  normally  increases  during  the  time  propagation  step  and  are  unchanged  during  a  measurement  update. 
(In  special  cases,  the  number  of  elements  may  be  reduced  -  see  Remarks  3.1  and  3.4.)  Moreover,  it  is  observed  that 
the  scalar  coefficients  of  the  exponents  are  functions  of  a  vector  of  parameters.  The  dimension  of  these  vectors  are 
unchanged  during  the  time  propagation  step,  and  increase  by  one  each  time  that  a  new  measurement  is  processed. 
Those  observations  provide  the  insight  and  guidance  on  how  to  construct  the  characteristic  functions  at  any  time 
step  k,  as  is  discussed  next. 

4.  Time  Propagation  and  Measurement  Update:  General  Case.  The  initial  results  presented  in  the 
previous  section  suggest  the  general  form  of  the  characteristic  function  of  the  ucpdf  of  the  state  at  any  time  step 
k  given  past  measurements  up  to  time  k  —  1  and  k.  Specifically,  based  on  the  measurement  updated  characteristic 
functions  obtained  in  (3.16)  and  (3.44)  for  k  =  1  and  2,  respectively,  we  assume  now  that  at  any  time  step 
the  characteristic  function  of  the  ucpdf  of  the  state  xw  given  all  the  data  history  up  to  that  time,  i.e.,  yk  = 
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Zl  Z2 


Zk  ,  is  expressed  as 


(4.1) 
where 

(4.2) 

and 

(4.3) 


k/k 

cp  v  gk]k  yk]k(v )  exp  yfc|  V)  , 

Xk\Yk  i  gi  ei 

/=  1 


9<  u.- ( *  =  2f 0  V;'Ud.  ■  ‘ 


r  +y  00 

/  g/2 


J° 


d«*  +/J4M 


/'>)  =  V'An  €R>,  /'»(*)=-  Vl‘lfa*l‘,WI+/f‘l‘.^ 


9' 


=  1 


Here,  y^V)  and  qk^k  are  /f  dimensional  vectors.  When  evaluating  gk^k  y^O  in  (4.2),  the  argument  y^fc(-)  is 
gi  i  ,,k  i  gi  gi 

partitioned  as  follows:  y  (■)  is  a  k  —  1  dimensional  vector  constracted  from  the  first  k  —  1  components  of  y  (■), 


g/1 


k\k  k\k 

while  the  scalar  3/^/2 ( ’ )  is  the  last  component  of  ygi  (-)•  Based  on  the  results  in  subsection  3.1,  at  k  =  1  the 

pi  k  I  k 

numerators  in  (4.2)  are  one.  Alternatively,  the  form  of  (4.2)  can  be  maintained  also  for  «  =  1  by  initializing  g  ■,  (-) 
«  n9l°(^)=  1- 

Now  we  will  perform  one  time  propagation  and  one  measurement  update  and  show  that  the  above  form  is 
maintained  at  any  time  step.  Although  the  derivations  are  similar  to  those  presented  in  the  previous  section,  a  full 
derivation  is  presented  here  for  clarity. 

4.1.  Time  Propagation  from  k  to  k  + 1.  Starting  with  the  characteristic  function  (PXk\Yk  v  of  (4.1)  we 
want  to  compute  \  yk  v  while  accounting  for  the  time  propagation  equation  Xk+ 1  =  0  Xk  +  T  Wk-  The  result 
in  (C.4)  of  Appendix  C  indicates  that  the  desired  characteristic  function  is  given  by 

(4.4)  cp  v  =  cp  Ory  cp  VTv  =  ntlkgk\k  yk^k(0Tv)  exp  yk^k(Orv)  e~^(r’v^ 

Xk+ 1  \  y k  Xk\Yk  l /V  j  gj 

ei  i=  1 

k/k 

_  nt  Qk\k  vk\k  W 

i  g/(Ory)  exp  yei{®Tv)-p\(r,v)\  . 

i=  1 

k\  k 

The  coefficient  functions  gf.  (■)  are  given  in  (4.2).  Their  arguments  and  those  of  the  exponents  in  (4.4)  can  be 
redefined  as  follows 

k/k 

yk+1  \k 

gi  \~i~Ygi 


(4.5a) 


k\k  "ei  k\k  k\k 

(v)  =  y  ■  (Ory)=  Vi  sgn  foa,  ,v)  eRk, 


=1 


W+1  |  k  k\k  nei  k\k  k\k  k\k 

ei  (P)  =  ye/  {®Tv)-p\(r,v)\  =  -  Pi  I  (CD  a,  ,v)\-p\(r,v)\+j(bi  ,v). 


(4.5b) 

With  these  definitions,  (4.4)  can  be  restated  as 


=1 


(4.6) 


cp  v 

Xk+1  I  Yk 


k\k  Jt+1  \k,  ,  Jc+1|#r,„,. 

g  y  (v)  exp  y  (v)  , 

i  gi  ei 


i=  1 


which  clearly  has  a  similar  form  as  (4.1).  Specifically,  defining  the  time  propagated  parameters  similarly  to  the 

Jc+1  \k  J<\k  i^k+ 1  \k  L~k\k 

five-steps  procedure  outlined  in  section  3.2  including  the  new  parameters  r  =  r  .  ,  h  =  h  ,  the  result 
above  can  be  restated  as 


(4.7) 


<P 


Xw  |  zk 


V  - 


9 


,k+ 1 1  k 


/=  1 


/+1 

gi 


|  k  k+1\k 

(V)  exp  y  (1/)  , 
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where 


(4.8) 


and 


9 


A+ 1 1  k 


/+ 

gi 


1 1  k 


( v )  = 


!  -  ^isr  c  |v> + hT lk  «5r  -  h- 


7c+1  |/c 


2rr 


A+1  |A - A+1  |A - A  +  1  |  A 


JCi 


+  d,  +y9i 2  (vo  v'c, 


A+1 1  k 


A  +  1  |  k - ArtfA - 

d,  +ygi 2  O') 


nfc*i/fc  /fc 

(4.9)  /+1  1 V)  =  "  qk+ 1  "sgn  fa'"1 ",  W  e  R\  /+1  "(v)  =  -  “  p^1 "  |  ( ak+ 1 ",  yj  |  +  j(bk+' ",  yj. 

gi  i  i  ei  i  /  / 

=  1  =1 

The  vector  y"+1  "(■)  is  partitioned  into  two  parts  used  to  evaluate  (4.8):  >""(■)  is  constructed  from  the  first  /c  —  1 
31  a  a 

components  of  y^"(-),  while  the  scalar  yg;2(-)  is  the  last  component  of  yg/  ^ 

The  number  of  terms  that  define  (Px^  jZk  ■  in  (4.7)  is  identical  to  that  of  (pXk\zk  ■  in  ^1T  However,  the 
number  of  elements  needed  to  define  the  arguments  of  those  terms  has  increased  by  one,  i.e.,  n  =  n  +  1.  It 

■  |(4-  "j  1 1^  _  “|  I  ©/  ©/ 

may  happen  that  for  a  given  /,  several  vectors  a  with  £  c  [1, . . . ,  n  ]  in  (4.9)  could  be  co-aligned.  In  such 

/  e# 

cases,  the  associated  elements  in  the  sums  of  (4.9)  could  be  combined,  thus  reducing  the  number  of  elements  in 
those  sums,  as  discussed  in  Remark  3.1. 

4.2.  Measurement  Update  at  k+\.  After  performing  the  general  time  propagation  step  in  the  previous 

subsection,  thus  attaining  the  characteristic  function.  <Pxk+^  \zk  '  °f  (4.7),  the  next  step  is  to  use  the  next  mea¬ 
surement  update  za+i  =  Hxa+i  +  i/a+i  to  determine  cp  ■  ,  where  yA+i  =  zi  z2  ■  ■  ■  za+i  .  Using  the 

XfcM  |  Yk+i 

general  convolution  results  of  (A.7)  in  Appendix  A, 


(4.10) 


■XfcMiYfcM  v  =2ff  v-HTq  <pv  -n  y'z^ndn 

—  oo 
CO 

1 


2tt 


XfcM  |  Yk 


v-  HTn  exp  ~Y\n\  +  jZk+Ml  dr /, 


where  from  (4.7) 
(4.11) 


r  n?Vk 

cp  v  -  HTrj 


XfcM  |  Yk 


9 


,A+1 1 A 


gA+1|A  y<+1|A  v_HTn  exp  yk+Mk  v_HTn  ' 
/  gi  ei 


i=  1 


/  (■)  is  given  in  (4.8),  and 

/+1"  v-HTn  =  n*Vk 


(4.12a) 


,A+1  |  A  A+1  |  A  T 


gi 


i  sgn  (a,  ,v-H  q)  e  R  , 


(4.12b) 


/+1"  v-HTq  =  -  n 


fcH  /fc 


A+1  |A  A+1  |  A 


A+1  |  A 


\(a,  ’V~H  n)\+j(b,  ,v-h  q). 


=i 


The  integral  in  (4.10)  is  solved  using  the  methodology  presented  in  Appendix  B.2  in  precisely  the  same  way 
that  (3.31)  was  solved  in  section  3.3  while  replacing  2 1 1  by  k  +  1  \k  and  2|2by/c+l|/c+l.  The  equivalent  forms  to 

(3.47),  (3.48)  and  (3.49)  are 


(4.13) 


k+i/fc+1 

(p  V  f  nk+ 1  |  A+1  lyA+1  |  A+1 


A+1 | A+1 


|  Yfc+1 


i=  1 


9 .  /+ 

gi  (v)  exp  yei  (1/)  , 


(4.14) 


9 


,A+1  | A+ 1  yA+1  | A+1  1  c  9U+1/fc+i  Xg/1 


m  ^1|/C+1M  +  /7A+1|A+1  gA|A  i/fc+i  yA+1|A+1((/)_rtA.+  1|A+1 


gi 


(v) 


Tf+rTTf+T 

A+1  |  A+1  'j,  -/+1IA+1  M/^  jn  A+1  |  A+1  w  1  j-.A+IIA+l  Dj 


_  -  .  ...  ,  JC+  K+  ,,K+  K+  /.,\  ;_/<+  /(+  J  ,,K+ 

27T  in^\k+ 1  +c//  +y9/2  O')  Jc  j  -d.  +ygi2 


O') 
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(4.15a) 

II 

S' 

+ 

(4.15b) 

W(+1  |  Ar+1 
ei  (V)  =  ~ 

„fc+i/fc+i 

% 
=1 

fc+1 lk+1 


,k+ 1  \k+<\ 


sgn 


k+ 1  |/c+1 


fc+1 


ra,-  ’  e  R  - 


n 


k+'llk+l  k+'llk+'l  k+'llk+'l 

Pi  I  (a,  ,  v)  |  +j(bj  ,v). 


=i 


-  k+'\  \k+'\  n  k+'\\k+'\ 

As  discussed  earlier  in  this  work,  if  for  a  given  index  /  several  vectors  a  ,  £  c  [1 , . . . ,  n  ]  are  co- aligned, 

/  e/ 

the  associated  elements  in  the  sums  of  (4.15)  could  be  combined,  consequently  reducing  nej  ,  the  number  of 
elements  in  those  sums. 

The  above  results  clearly  show  that  the  form  of  the  characteristic  function  proposed  in  (4. 1-4.3)  for  the  time 
step  k  is  also  maintained  at  k  +  1 . 


5.  Conditional  Mean  and  Estimation  Error  Variance.  The  minimum  conditional  variance  estimator 
of  Xk  given  the  measurement  sequence  yk  =  zi  Z2  ■  ■  ■  Zk  is  the  conditional  mean  of  Xk  given  yk .  It  can  be 
determined  by  evaluating  the  characteristic  function  of  (4. 1 )  and  its  derivatives  at  v  =  {0}n,  or  as  v  —>  {0 }n.  In  this 

section  we  show  that  <^fc|  Yk  1  is  twice  continuously  differentiable  and  give  explicit  expressions  for  the  conditional 
mean  and  the  estimation  error  variance. 


5.1.  Continuity  of  the  First  Two  Derivatives  of  the  Characteristic  Function.  The  continuity  of  the 
first  two  derivatives  of  (Pxk\Yk  ■  is  proven  by  induction.  First  the  case  that  hi  /=  0,  /  =  {\,  . . . ,  n}  is  addressed. 
The  case  when  hi  =  0  for  some  /  c  {\,  . . . ,  n}  is  addressed  at  the  end  of  this  subsection. 

5.1.1.  Continuity  of  the  First  Two  Derivatives  of  <Px1\y1  .  Assuming  that  hj/=  0  for  /  =  {l,  n},  the 

characteristic  function  for  the  ucpdf  of  initial  state  given  the  first  measurement  is  given  by  the  convolution  integral 
in  (3.10).  Then,  using  the  definitions  in  (2.4)  and  (3.11),  (3.10)  is  rewritten  as 


(5.1) 


i/  —  n 

I  Xl\Zi  2 TT  e~a^-^e-Y\n\+izMidn 


/=  i 


where  v,  =  (fJi,  v)  and  v=  v\  ■  ■■  vn  7  ■  The  integrand  of  (5.1)  is  a  continuous  function  of  r?  and  tfV  /  =  1 , ,n. 
Moreover,  its  partial  derivative  with  respect  to  any  Cf,  given  by 


5  (n" 

(5.2)  e-Pi\vi-n\ e-Y\n\+jzin 

OVi 

i=  1 


n 


=  — p7sgn(tf-/7)e  pJ^  n\  e  p,|5;  ple 


-Y\n\+jzin 


ft 


is  piecewise  continuous,  bounded,  and  integrable.  Hence,  when  computing  the  partial  derivative  of  <pXi  |Zi  v  with 
respect  to  Cf,  the  differentiation  and  integration  operations  can  be  reversed  [2]  to  yield 


(5.3) 


d<  Xi  \z, 

dvj 


1  00  ri7 

_  p  sgn(\/  —  q)e~Pj^vJ~n\  e~Pi\vi~n\ e-Y\n\+iz^ndr\ 

2  TT  J  J 

—  oo  y  — 


Since  the  integrand  in  (5.3)  is  piecewise  continuous  and  bounded  functions  of  r)  and  tfV /  =  1 , . . . ,  n,  the  first 
derivativeS^i^  i/  /dv  is  continuous  [2], 

The  secondpartial  derivatives  of  ^  |Zl  v  is  attained  by  differentiating  (5.3).  To  avoid  the  differentiation  of  the 
piecewise  continuous  function  sgn  (yj  -  n),  we  introduce  the  change  of  variables  Vj-r]  =  o  =>  r)  =  i j  —  o,  dr)  =  —da 
to  restate  (5.3)  as 


(5.4) 


S<  xpz1  y 
Bvj 


1 

2tt 


n 


pysgn (or) [  e ~Pi I %-Yr+° I  e -y I  y-o I  +jz i (y-a) do 


i=  1 


The  integrand  in  (5.4)  is  a  piecewise  continuous  in  cr,  continuous  in  v,  V  /  =  1 , . . . ,  n,  and  its  first  partial  derivative  with 
respect  to  v,  is  piecewise  continuous,  bounded,  and  integrable.  Consequently,  in  computing  the  partial  derivative 
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of  (5.4)  with  respect  to  v ,  the  order  of  differentiation  and  integration  can  be  interchanged  as 

co  C  C 

d2(p 


(5.5) 


1 


5  nn  n 

lzi  v _  P/Sgn(CT)e_ftlCTl -  e-Pi\Vi-Vj+a\ e-Y\Vj-a\+jz:(Vj-a)-' cj0_ 

dvdvj  277  dv 


The  partial  derivative  and  hence  the  integrand  in  (5.5)  is  a  piecewise  continuous  and  bounded  function  of  the 
variables  cr  and  £f  V  /  =  \, ...  ,n.  Therefore,  the  integrals  exist  for  all  j  and  £  and  are  continuous  with  respect  to  all 

(vj,  v)  [2],  This  implies  that  all  the  elements  of  the  Hessian  of  <pXl  \z,  v  are  continuous  for  all  v. 

_  In  summary,  |Zl  v  is  twice  continuously  differentiable  with  respect  to  v.  Since  v  is  a  linear  function  of  v, 
i  Xi  I Zi  v  is  twice  continuously  differentiable  with  respect  to  v. 

5.1.2.  Continuity  of  the  First  Two  Derivatives  of  <Px2\y2-  The  time  propagated  characteristic  function 
at  k  =  2,  which  was  given  in  (3.22),  is  considered  first.  It  is  rewritten  as 


(5.6) 


1  x2\z,  V  =  <  Xl  \Zi 


(D Tv  e-&rTvL 


Since,  as  it  was  shown  above,  (DXi  | Zi  v  is  twice  continuously  differentiable,  <pXi  |Zi  ®r\/  is  also  twice  continuously 
differentiable  for  any  transition  matrix  ® .  Clearly,  <Px2\Zl  y  is  continuous,  being  a  product  of  two  continuous  func¬ 
tions.  However,  since  the  first  derivative  of  e~HrTv\  is  not  continuous  at  v  =  {0}n,  the  associated  pdf  fy2\z,  X2|zi 
does  not  have  any  moments.  This  implies  that  we  cannot  compute  a  priori  estimates  of  the  state  X2  given  only  the 
past  measurement  ^ . 

The  characteristic  function  at  the  second  measurement  update,  given  the  measurement  history  y2  =  {z- 1,  Z2}, 
was  expressed  in  (3.31)  and  rewritten  here  using  the  explicit  form  of  (5.6)  as 


(5.7) 


OO 

<  x2|y2  v  =2t t  1  x,\z,(y -HTrDe-WTv-Hrn\e-v\n\+iz2ndn, 

—  OO 


where  we  used  the  notation  (pXi  |Zi  (V)  =  <  x ■,  \ zn  (I>7''/  •  Note  that  if  HT  =  0,  the  term  e  would  come  out  of 

the  integral  in  (5.7).  Consequently,  in  this  case  <Px2\z2  v  would  not  be  continuously  differentiable  with  respect  to 
v,  and  there  would  be  no  minimum  variance  estimate  of  Xk  given  yk  for  all  k  >  2.  Therefore,  HT  /=  0  is  a  necessary 
condition  for  the  continuous  differentiability  of  (pXl  \  y2  1/  and  the  existence  of  the  desired  estimate  of  the  state  Xk 
at  any  time  step  k. 

Horn  section  5.  1-1.  Vx^z.iv  -  HTn )  is  twice  continuously  differentiable  with  respect  to  v.  Moreover, 
e-/3|r  v~Hrn\  js  once  piecewise  continuously  differentiable  with  respect  to  v  as 


(5.8) 


de-i3\rrv-Hrn\ 

_ =  -psgn 

dv 


FTv-HTrj  e  ^|r  H-Hnjipr 


Therefore,  when  constructing  the  first  derivative  of  (pX2 1  Yl  v  with  respect  to  v,  the  order  of  differentiation  and 
integration  in  can  reversed  to  yield 


1  di  1  zSv  R-p\rT  v-HVn\e-y\n\+jz2n dn 

2rr  dv 

—  OO 

]  OO 

+  -fiT  «  X,  I  z,(v-HTn)  -/3sgn  Ttv  -  HT  q  e“/3|r  e-Y\n\*J^ndn. 

—  OO 

Since  the  integrands  of  (5.9)  are  piecewise  continuous  and  bounded  functions  of  /?  and  vh  i  =  l,...,  n, 

3<  x2|y2  v  is  continuous. 

The  second  partial  derivative  of  1  Yl  v  can  be  taken  after  making  the  following  change  of  variables 

VTv  VTv 


(5.9)  ot  *2\y2  '/  _ 

dv 


HT 


HT 


(5.10) 


-  n  =  o  =*  n  = 


—  o,  drj  =  -do. 
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Substituting  (5.10)  into  (5.9)  we  obtain 


(5.11)  di  x2\y2  v  _ 
dv 


1  <9<~Xl|  Zl(Pv  +  H' 


2rr 


o') 

oo3v 


l-P\HVo\R-Y  HT 


do 


.  1  ~  ( Pv  +  Hto )  /3sgn(Hr  a)e 

%TT  IZi 


-jSIHfal  p  T  e~Y_ 


-a  +jz  2 


-cr  dO, 


where  P  =  1  -  HT(H  F )  1  T  r.  The  second  partial  derivatives  of  <pXl  |  y2  v  with  respect  to^y  are  obtained  by  differ¬ 
entiating  (5.11).  This  requires  taking  the  differentialjwith  respect  to  v  of  e  v  Tn~  °  +JZ2  ,  which  is  piecewise 

continuously  differentiable  and  bounded,  and  of  dcpXl  |Zl  (Pv  +  HTo)/dv,  which  is  continuously  differentiable  by 
the  result  of  the  previous  subsection.  This  implies  that  the  integrands  in  (5.1 1)  are  piecewise  continuous  functions 
of  cr,  are  continuous  in  v ,  and  have  piecewise  continuous  and  bounded  first  partial  derivatives  with  respect  to  v. 
Consequently,  this  implies  that  the  Hessian  of  <■ Px2  \y2  v  is  continuous  [2], 

5.1.3.  Continuity  of  the  First  Two  Derivatives  of  <Pxk+1  \  vfc+1  •  In  a  recursive  manner,  the  continuity  of 
the  first  two  derivatives  of  the  characteristic  function  is  now  shown  to  be  maintained  when  propagating  from  time 
step  k  to  time  step  k+  1,  as  it  was  maintained  in  going  from  times  step  1  to  2  in  subsection  5.1.2.  It  is  assumed 
that  using  the  recursion  up  to  stage  k ,  <PXk\Yk  v  as  given  in  (4.1)  is  twice  continuously  differentiable. 

The  time  propagated  characteristic  function  from  k  to  k  +  1 ,  given  in  (4.4),  is  written  as 


(5.12) 


*  XfcM  |  Yk  V  -  <  Xk\Yk 


<S>tv  e  PFTv\. 


Since  e-/3|r3>l  is  not  continuously  differentiable,  <PXk+1  \Yk  v  is  not  continuously  differentiable  and  minimum  vari¬ 
ance  estimates  cannot  be  determined  before  the  next  measurement  is  processed. 

The  measurement  updated  characteristic  function  at  k+\  is  given  in  (4.10)  and  rewritten  here  using  the  explicit 
form  of  (5.12)  as 


(5.13) 


1  X**i  I  Yk+i  V 


CO  ~ 

X  (  (v  -  ht rt)e~^V7v~Hrn]e~^+‘Zk*'  i  dq, 

2  TT  Xk\Zk 

—  oo 


where (PXk^Yk(v)  =  <  Xk\Yk  ®7"'/  •  The  ^rst  derivative  of  <PXm  |Yfc+1  v  is  obtained  in  a  manner  similar  to  that  used 
to  construct  (5.11).  To  take  the  second  derivative,  the  change  of  variables  given  in  (5.10)  is  used  to  obtain 


15  14)  d*  Xk+i  I  yWi  v  _ 
dv 


1  d(  Xk\YK 

Tn 


(P\/  _i_  /_/ 7”  rT v  rT v 

- p — /h  HT a  |  p  ~  V~ht  a  +fzfc+'i  — 


w 

dv 


do 


_  1 


2tt  VkkiYk 


(, Pv  +  Hto )  /Bsgn(HF  o)e~/3IHral  F  7  e  Y 


-cr  +jzfc*1 


-cr  dO. 


The  integrands  in  (5.14)  are  at  least  piecewise  continuous  functions  of  o  and  continuous  in  v.  Moreover,  their 
first  partial  derivatives  with  respect  to  v  are  piecewise  continuous  and  bounded.  Therefore,  (px^i  \  ym  v  is  twice 
continuously  differentiable  [2],  thereby  justifying  the  supposition  that  the  recursion  produces  a  twice  differentiable 
characteristic  function  after  each  measurement. 

5.1.4.  Measurement  Matrix  H  with  Zero  Elements.  Suppose  H  has  only  one  zero  element,  hj  =  0,  and 

assume  that  (H,  O)  is  an  observable  pair.  Therefore,  (5.1)  can  be  written  as 


(5.15) 


cp  v 

X ,  IZi 


-  tiaj\(ej,v)\ 

2tt 


i-cti\vi-hiri\e-y\n\+jziridri  — 


\(ej'v)  I  _ 


i=1 

'/=j 


2tt 


cp  (e,v), 

X !  \Z-]  (  j  ) 


where  i  x^\z^((q,  v))  is  determined  by  the  integral  in  (5.15)  and  q  is  a  nx  n  —  1  matrix  defined  as  q  = 
{e\, . . . ,  e/- 1,  ey+i, . . . ,  en}.  Due  to  our  previous  result  in  subsection  5.1.1,  *  x,  \z:  {(q,  v))  and  thus  cp< ,  iz-,  v 


16 


M.  IDAN  AND  J.  L.  SPEYER 


is  twice  continuously  differentiable  with  respect  to  the  (q,  v)  =  {Vi ^  v  ,...,vnj.T.  However,  clearly 

1  XiiZi  v  is  not  continuously  differentiable  with  respect  to  (ej,  v)  =  vj  due  to  the  term  e_0t'l('ewWI  jn  (5.15). 
Therefore,  only  a  reduced-order  state  estimate  can  be  constructed  at  k-  1. 

Using  (5.15),  the  time  propagated  characteristic  function  at  k  =  2  is  given  by 

(5.16)  (p  V  =  (ff  (pTv)e-aj\eJ<i>iv\e-li\(T,V)\ 

<P 

x2\z1  xnz,  j 

After  the  second  measurement  update  it  is  expressed  as 

,  OO 

(5.17)  q)  v  =_  cp  ®ri/  -  HOe  r\  e~aj\ejP  tv-h<x>5  n)\e-P\rHv-HT n)\e-v\n\+jz2ndn 

X2W2  2tt  Xllzi  J 

—  OO 

If  HOej  =  0,  as  in  (5.15),  (5.17)  decomposes  as 

ctj  eT  Tv\ 

(5-18)  <x2|Y2  v  _  e~  2^. - 1  x2\Y2^j<3>Tv), 

where  <Px  ^(^(D7!/)  is  defined  by  the  remaining  integral  in  (5.17)  after  removing  e~°j'eJ P  U  from  it.  Clearly, 

in  this  case  <pJ  v  is  not  continuously  differentiable  at  v  =  {0}n.  However,  if  HOey  /=  0,  there  are  no  terms 
x2 1  r2 

that  can  be  factored  out  like  in  (5.18).  Moreover,  since  the  integrand  in  (5.17)  is  continuous  in  n  and  v,  and  has 

piecewise  continuous  and  bounded  derivatives  with  respect  to  those  variables,  using  the  procedure  established  in 

subsection5.1 .2,  %2\Y2  ^  is  twice  continuously  differentiable.  Consequently,  due  to  the  result  in  subsection  5.1.3, 

when  H  O  ey/=  0,  (3  2  v  is  twice  continuously  differentiable  for  any  k  >  2. 

Xk\Yk 

Now  we  return  to  the  case  that  HOey  =  0.  Using  the  steps  presented  above,  if  H02ey/=  0,  then  (&k\Yk  v 
is  twice  continuously  differentiable  for  any  k  >  3.  Similarly,  if  also  A702ey  =  0  but  A703ey  /=  0,  continuity 
can  be  proved  for  k  >  4.  Since  the  system  is  assumed  to  be  observable,  there  always  exist  1  <  £  <  n  such 
that 

/-/Op_1ey  =  0  Vp  <  £  and  HO  _1ey/=0.  Consequently,  (pk\yk  v  is  twice  continuously  differentiable  for  any  k  >  £. 
This  procedure  can  be  extended  to  any  number  of  zero  elements  of  H. 

5.2.  Construction  of  the  Conditional  Mean  and  Estimation  Error  Variance.  Having  established  in  the 
previous  section  that  cp>(k \Yk  v  is  twice  continuously  differentiable,  the  explicit  form  of  the  pdf  of  the  measurement 
history,  the  conditional  mean,  and  conditional  variance  is  determined  from  (4. 1-4.3)  by  evaluating  (pXk \Yk  v  and 
its  derivative  as  v  —>  {0}n.  We  choose  v  -  v,  where  is  a  positive  scalar  such  that  ^  0  and  \x is  a  fixed  direction 
in  the  v  domain  for  which 

(5.19)  (a^k,v)  0,  V  (/,£). 

A 


Along  this  direction  the  conditional  mean  and  conditional  variance  will  be  shown  to  be  easily  computable,  avoiding 
the  discontinuity  issues  of  sgn  ( a  ,  v)  when  the  condition  in  (5.19)  does  not  hold.  Due  to  the  continuity  of 

(  Xk |  Yk  v  and  its  first  two  derivatives,  any  v  can  be  chosen  as  long  as  (5.19)  holds.  With  this  choice  of  v, 

(5.20)  sgn  (ak^k,  v)  =  sgn  (ak^k,  v)  =  s' 

/  / 

are  well  defined  constants  that  satisfy  |s' |  =  1.  from  (4. 1-4.3),  cp  ■  along  this  direction  is  given  by 

Xfc|  Yk 


(5.21) 


cp  v 

Xfcl  Yk 


/ 


/=  1 


exp  V)  , 

ei 


k\k 

where  the  functions  g;  (■)  are  given  in  (4.2).  Their  arguments,  defined  in  (4.3),  are 


(5.22) 


/I* 

gi  (  ^  = 


k\k 

Qi  sgn 


k\k 

(a  ,  ,  v)  = 


k/k 


k\k  j  k\k 

Qi  s  =  ygi  (to 


e  R 


=1  =1 

^  /c  I  /f  ..  I 

Since  the  vector  \x is  constant,  the  arguments  y  (■)  and  thus  the  coefficient  functions  g  (■)  are  constant  along 

gi  < 

the  chosen  direction.  This  will  simplify  greatly  the  evaluation  of  (PXklYk  ■  and  its  derivatives  at  the  origin  of  the 
spectral  variable  v. 
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(5.23) 


Similarly,  the  arguments  y^fU)  of  the  exponents  in  (5.21),  defined  in  (4.3),  are  manipulated  as  follows 


nk'k  nklk 

et  k\k  k\k  k\k  el  k\k  k\k  k\k  k\k 

(v)  =  -  Pi  |  fa  ,  v)\  +j(b;  ,  v)  =  -  Pi  sgn  fa  ,  v)  ( a ,•  ,  i '/)+j(b;  ,  v) 

=i  =i 

L  L 


=  D-  P 


k\ ksi(ak]k,  v)  +  j(bklk,  v)0=  (fk(v),  v), 


where  we  have  defined 
(5.24) 


ei  i  i  i 

=  1 


k\k  k\k 

which  is  a  n-dimensional  vector.  Since  the  vector  v  is  constant,  so  is  the  expression  (yei  (v),  v),  making  yei  (■)  of 
the  exponents  linear  in  along  the  chosen  direction.  Using  the  above  results,  (5.21)  can  be  restated  as 


(5.25) 


<P  v  =nt  gklk  yklk(V)  exp  (?'*(&),  v) 

Xk\Yk  i  gi  ei 

/=  1 


This  last  form  of  cPXk\Yk  v  now  be  used  to  compute  the  minimum  variance  estimate  and  its  error  variance. 


(5.26) 


The  conditional  mean  of  the  state  Xk  given  the  data  sequence  yk  is  given  by 

J _ kfi  ~Xk\Yk  v  t 


xk  =  E[xk\yk]  = 


JfYk  Yk 


d(  V) 


E= 0 


The  pdf  fYk  yk  needed  to  normalize  the  above  result  is  determined  by  evaluating  <PXfc|yfc  v  at  =0.  Using 
(5.25)  we  obtain  that 


Mk 


(5.27) 


V.  y*  =  lk,r„  V  E_o 


C  C®  ■ 


—  /=  1 


k\k  k\k  ~ 

where  g  (■)  are  given  in  (4.2)  and  the  arguments  y  (y)  were  defined  in  (5.22). 

/  _  gi 

The  derivative  of  9U,  lw  ■  used  in  (5.26)  is  determined  by  differentiating  (5.25).  Since,  as  stated  earlier, 

gk\ k  yk\k  _  k\k  _fc| 1  rk 

i  gi  (K>  and  (yei  (V),  v)  are  constant,  the  derivative  is  evaluated  as 


V 


(5.28) 


5(  5) 


gK'K  F|KWexp  crKCv),  v) 

i  gi  ei  ei 


/=  1 


Evaluating  (5.28)  at  =  0  yields  the  minimum  conditional  variance  estimate 

1  fc/fc 

_ 1 _ nt  k  k  k  k  k  k 


(5.29) 


Xk=  9/1  yJ  (y)  yel  (^ 

y/<  /=1  9 


where  fyy  y/<  ,  gi  (-),  yg;  (C),  and  y,,-  (y)  are  given  in  (5.27),  (4.2),  (5.22)  and  (5.24),  respectively. 
The  second  conditional  moment  of  xw  given  yk  is  determined  by 

1  d2(p  v 

(5.30)  E[xkxT\yk\  =  _  xk\Yk 

k  j2  fYk  yk  d{V)d(V)T  E=0‘ 

The  second  derivative  in  the  above  equation  is  computed  by  differentiating  (5.28)  to  yield 

T 


(5.31) 


d2,XklYk  V  =nk'kgk\k  yUk^  -k\k(S/)  -k\k(9)  exp 

d(  V)d(  V)1  '  9'  Yei  Yei  (  ei  ) 

i=  1 
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Hence  the  second  moment  is  given  by 


(5.32) 


E[xkxT\yk]  = 

K  -2 


1 


Mk 


gk\k  /\\V) 

f  a  »  ,,i 9'  ”  y"  M 


Finally,  from  (5.29)  and  (5.32),  the  error  Xk  =  Xk  —  Xk  variance  can  be  evaluated  as 
(5.33)  E[Xk~xT  | yk]  =  E[xkxT \yk]  -  xkxT . 

k  k  k 


5.3.  Conditional  Mean  and  Estimation  Error  Variance  at  k  =  1.  To  appreciate  the  complex  results 
given  in  section  5.2  and  to  understand  the  underlying  structure  of  the  Cauchy  estimator,  greatly  simplified  and 
compact  expressions  are  determined  for  k  =  1 .  Here  we  address  the  case  that  h,  /=  0  for  /  =  {l,  ■  ■  ■ ,  n}.  The 
characteristic  function  (f^k\Zk  v  evaluated  along  v  =  v  is  given  in  (5.25).  For  k  —  1  it  can  be  easily  simplified  to 
(5.34) 


□  □ 


□  □ 


(p  v  = 

X1 1  z, 


"+1  -n  0+1  □  □[«. 

CDyz  +p  +  ps  ~  JZ 


r.  3  2  i 

2  TT  /=  1 


=  1 
/=/ 


/□  □  1 


n+ 1 

P  +  P  s 
/ 

=  1 
/=/ 


lP*in 

n  □ 


□ 


n+ 1 


exp 


p  (ar  i/js  +yz  p'  F  . 


-□ 


/□  □ 


□ 


=i 

/=/ 


The  pdf  of  the  measurement  history,  the  conditional  mean,  and  the  conditional  variance  are  found  by  evaluating 
(5.34)  and  its  first  two  derivatives  at  =  0.  There  are  many  straight  forward  manipulations  needed  to  obtain  our 
final  compact  results  presented  here.  These  detailed  derivations  are  omitted  here  for  brevity. 

First,  fyk  yk  k=i  -  fz,  z i  ,  given  in  (5.27),  is  determined  by  letting  0  in  (5.34)  and  noting  that  most  of 
the  term  in  the  remaining  sum  cancel  leaving  all  but  the  first  and  last  terms.  The  final  form  is  thus  determined  as 


(5.35) 


□ 

_  1  □ 


HI  \-l( 


n+ 1 


\-1D 


). 


fz-t  z  i  =  -L  -  jz i  +pi 
2  TT 


□  _  1 


a  \h  \+y 


TT 


=2 


z?  +  or  \h 

=i 


2  ■ 


In  the  last  equality,  the  initial  parameters  in  (2.4)  and  (3.11)  are  used: 

The  conditional  mean  is  computed  by  taking  the  derivative  of  <Px^\z^  ^  given  by  (5.34)  and  then  letting 

0.  We  obtain 
(5.36) 


a<*iizi  v 

crv 


E=  0 


□  □□ 

□  -i 

□ 

n+1 

□ 

n+1 

n+1 

n+1 

1 

□ 

jz 

-P  + 

□  □ 

2TT 

|UL  yzi  +P/  + 

P  S/D  - 

DJ  1 

/  P  S/  □ 

□  □ 

/ 

i=  1 

=1 

=  1 

=  1 

/=/ 

/=/ 

/=/ 

DDn 

T  ■  ,□  u 

p  s,a,  +iZip(T 


where  a  /  =  —a/  was  used  in  (5.36).  To  simplify  the  above,  place  numerator  terms  over  the  common  denominator 
elements  as  given  in  the  sum  of  (5.36).  In  doing  so,  after  a  bit  of  straight  forward  algebra  and  using  the  decomposition 
a,  =  p,  —  p  ,  these  elements  simplify  such  that  all  but  the  first  and  last  terms  cancel.  Therefore,  we  obtain 


(5.37) 


d  v 


E=  0 


.  X 

yzi  p  pr 
fi 

■  ^Tt+T 

l’,P 


Dividing  (5.37)  by  (5.35)  and  the  imaginary  number  j,  the  conditional  mean  is  determined  as 


l 

z  i  P  P 


(5.38) 


xi  = 


orsgn(A7)  ■■■  a  sgn(/7  |T 
=  zi _ L_ 


n+1 


2;  p 


r. 


=i 


a  |  hj  | 


This  is  the  explicit  form  of  the  conditional  mean  given  in  (5.29)  evaluated  and  simplified  at  k  =  1 . 
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The  conditional  second  moment  is  determined  from  the  second  partial  derivative  of  <pXi  |Zi  v  with  respect  to 
v  and  letting  ^  0  as 


(5.39)  3  <  IZi  v 


=1 


3  vd  V 1 


□  □ 

□  _i  □ 

□  -1 

□ 

n+ED  ■ 

n+ 1  □  □  • 

n+ 1  □ 

□ 

F 

+  p  +  p  s  -  JZ 

-P  +  P  s 

□  □  i 

i  i  □  □  1 

i  /□ 

□ 

i=i 

=  1 

=  1 

/=/  □ 

/='  □  □ 

□ 


□ 


n+ 1 


□  □ 


n+ 1 


T  TO 

□  -  p  s,a  i  +  jz-ipi  □  -  p  s,a  +yzi /v,  □ 

=i  =i 

/=/ 

/=/ 


To  simplify  (5.39).  place  numerator  terms  over  the  common  denominator  elements  as  given  explicitly  in  the  sum 
in  (5.39).  In  doing  so,  after  a  bit  of  straight  forward  algebra,  having  made  use  of  the  decomposition  a;  =  pi  —  p  , 
these  elements  simplify  greatly.  The  result  is 

(n-1  (V  ( 


(5.40) 


<9  « 


77 


rXi  |Zt 


-  =1*’  p 


3  i/3  i/r 


P  P  P  PT 

( - 2=1 - 

'n+1  ^ 


P  P  P 


E=  0 


^2+  ), 


=  1 


iP 


The  conditional  second  moment  is  given  by 
(5.41) 


,  .  1  3  v 

f  r ,  1  _ 

E  xix(|  |zi  =  -  - 

Zi(Zl)  p  p  -7  ^=0 

3  v3  v' 


where  fz!  (zi )  is  found  in  (5.35).  The  conditional  error  variance,  where  xi  =  xi  —  xi ,  is  then  determined  as 


f  _  I 

(5.42)  E  xixr  |zi  =  1  +  \ 


7? 


(  /=1  or,  |  /?/ 1  +  K) 


...  X 

a  \h  \  +y 

i  i 

/=2 


-oficrnsgn(hi)sgn  (hn) 


□ 


-cr  a  nsgn  (h  }  sgn  (h  J 

(n- 1  : 


CTn 


\ 

□ 


l^l  i=1 


a/ 1  ft;  I  +  K 


This  is  an  explicit  form  of  the  conditional  variance  given  in  (5.33)  for  k  =  1 . 

It  is  surprising  that  at  k  =  1  the  pdf  of  the  measurement  history,  the  minimum  variance  estimate  and  the 
estimation  error  covariance  matrix  given,  respectively,  in  (5.35),  (5.38),  and  (5.42)  can  be  expressed  analytically  in 
such  a  compact  and  relatively  simple  form  as  functions  of  the  system  parameters  and  the  first  measurement.  Such 
simplicity  is  not  retained  for  the  subsequent  time  steps.  It  is  also  interesting  to  note  that,  contrary  to  the  Gaussian 
case  where  the  estimation  error  variance  is  known  a  priori,  the  estimation  error  variance  of  (5.42)  is  an  explicit 
(quadratic)  function  of  the  measurement  zi  . 

6.  Three-Dimensional  State  Estimator.  The  performance  of  the  proposed  estimator  was  tested  numeri- 


cally.  The  system  parameters  were  chosen  as  follows: 

□  □ 

□  □ 

□  □  □  □ 

1.4  -0.6  -1.0 

0.1 

cn  0.1 

0  =  4  -0.2  1.0  0.5  ^ 

II 

□ 

o 

□ 

□  a2  L  =  D  0.08  4 , 

f  0.6  -0.6  -0.2  j 

H  =  1.0  0.5  0.2  , 

-0.2 

(73  0.05 

13  =  0.1, 

-'N 

II 

O 

fo 

The  system  has  stable  eigenvalues  at  0.7  ±  0.3 j  and  0.8.  It  is  observable  and  complies  with  the  necessary  condition 
that  HT  /=  0. 

The  simulation  results  are  depicted  in  Fig.  6.1,  showing  the  estimation  errors  in  6.1(a),  and  the  cross-correlation, 
process  and  measurement  noises  in  6.1(b).  It  can  be  clearly  observed  that  the  estimation  error  standard  deviation 
depends  on  the  measurement  sequence.  Specifically,  the  standard  deviation  increases  when  a  large  measurement 
noise  is  encountered,  e.g.,  at  time  step  #4. 
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1  i - 1 - r - 1 - 1 - 1 - t 


12  3  4  5  6  7  8 


1  I - 1 - r - 1 - 1 - 1 - T 


1  2  3  4  5  6  7  8 


1  2  3  4  5  6  7  8 


Step# 


Step# 


(a)  Estimation  Errors 


(b)  Cross-correlations  and  system  noises 


Fig.  6.1.  Three-State  Estimation  Example:  Numerical  Results 


7.  Conclusions.  For  a  linear  discrete-time  system  with  additive  Cauchy  measurement  and  process  noises,  an 
analytic  recursion  of  the  characteristic  function  of  the  unnormalized  condition  pdf  is  determined  for  the  n-vector 
state  estimator.  This  characteristic  function  is  shown  to  be  a  growing  sum  of  terms  involving  a  coefficient  function 
multiplied  by  an  exponential  whose  arguments  are  weighted  nonlinear  functions  of  the  spectral  variable.  It  is 
then  shown  that  this  characteristic  function  is  twice  continuously  differentiable,  allowing  the  determination  of  the 
conditional  mean  and  the  conditional  second  moment,  from  which  the  conditional  error  variance  is  determined. 
These  moments  are  a  nonlinear  function  of  the  measurement  history,  where  the  conditional  mean  constitutes  the 
Cauchy  estimator  for  the  multivariable  system.  The  estimator  was  then  evaluated  numerically  for  a  third-order 
example.  At  this  stage  of  the  numerical  development,  only  a  few  steps  in  time  are  shown  for  this  example  because 
of  the  excessively  large  number  of  terms  needed  to  represent  the  characteristic  function.  Nonetheless,  this  is  the 
first  time  that  a  closed-form  analytical  estimator  is  derived  for  a  multivariable  system  with  heavy  tailed  Cauchy 
process  and  measurement  noises. 
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Appendix  A.  Characteristic  Function  of  an  Unnormalized  Conditional  PDF  Given  a  Scalar  Mea¬ 
surement.  Consider  a  scalar  measurement  of  an  /7-dimensional  state  vector  as 


(A.l) 


z  =  Hx  +  v  =  hixi  +  v. 

i=  1 

where  z  is  a  given  scalar  measurement,  x  e  Rn,  and  v  is  Cauchy  distributed.  For  this  measurement,  the  characteristic 
function  of  the  unnormalized  conditional  density  function  is  given  as 

OO  OO 

(A.2)  <  xiz  v  =  ■"  fx  x  fv  z~Hx  eivT*dx. 


-OO  — OO 


The  above  resembles  a  Fourier  transform  of  a  product  of  two  functions:  fx  x  and  fv  z  —  Hx  .  Using  the  dual 
convolution  property,  this  integral  can  be  solved  by  a  convolution  in  the  v  domain  between  the  characteristic 
function  of  fx  x  ,  i.e.,  cpx  v  ,  and  the  characteristic  function  of  fv  z  —  Hx  ,  which  we  denote  by  cpv  v  .  Flence, 
i  X|Z  v  can  be  computed  by  the  convolution  integral 


OO  OO 


(A.3) 


x\z  V  = 


1 


(2  rr)n 


q>x  v-  n  w  n  dn- 


The  characteristic  function  </V  v  is  determined  as  follows 

OO  OO 

(A.4)  cpv  v  =  ...  fv  z-Hx  eiv  1  Xdx  = 


OO  OO 


<  „  \ 

fv  z-  hx  e  ’,“|,"'cfai 


-OO  — OO 


-OO  — OO 


/=  1 


).n 


Assuming,  without  loss  of  generality,  that  hn  /=  0,  we  perform  the  change  of  variables  <f  =  z  —  hiXi.  This 

implies  xn  —  z  —  f,  —  ’"=1  hjX:  /hn  and  for  fixed  x;,  /  =  1, —  1,  dxn  =  d£/|  hn  \ .  Using  this  change  of 
variables,  (A.4)  is  manipulated  as 

(A.5)  cpv{v)=  ...  fv(OeJ'  w  1  ^  d^/\  hn\dx-\  ■  ■  ■  dx„-i 


ei' 


00  —  □  □ 
fnZ/hn  21  .  □  □ 

-  fv(£)e-J^dt 

\h„\  3 


OO  OO 


-OO  — OO 


□ 

□ 


n- 1  V  —  “  V  X 
i=  1  *  hn  n 


‘dx 1  ■  ■  ■  dxn 


QjVnZ/hn  V  .  .  n  1 

- W  -  w 

I/Inl  h 


n — 1 


"n  □  /  ^  ^  dXV  =  «p„  -  Vn  1  1  (277)5  "vi-hiVn  , 


'n  /=  1 


- ^  h 

\hn\  hr 


/=  1 


where  5(-)  is  the  Dirac  delta  function.  Substitute  (A.5)  into  (A.3)  to  obtain 


n_1 


OO  OO 


(  n— 1  ( 

a n 


(A.6) 
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(V)  = 


(2  if) 
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O 
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277 1 /7n 


<px 


<px{v  -  n)^ hnz %  n  m  -  -  qn  dty  ■  ■  ■  drin-\drin 

11  n  ;=1  5  nn 


I  lT  HH  In  (  n 

v  -  H  _/,  z„  fin  dqn- 


hn  e/nnz(pv  _■ 


hn 


Finally,  using  the  change  of  variables  /)  =  /fn/fu  and  hence  dr)n  -  dq\hn\,  the  last  equation  can  be  restated  as 


(A.7) 


x\z^-jff  Vx  v~HTn  <Pv  (-/7) eizn drj. 


Appendix  B.  Integral  of  an  Exponent  of  Absolute  Values.  The  measurement  update  stage  of  the 
Cauchy  estimator  entails  evaluating  a  convolution  integral  that  involves  an  exponential  of  absolute  values  of  affine 
functions  in  the  spectral  variables.  In  this  appendix  we  present  a  methodology  for  carrying  out  such  integrals. 
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B.l.  Exponent-Only  Integral.  In  the  first  scalar  measurement  update,  e.g.,  (3.8),  we  encountered  the  basic 
integral  of  the  form 

f(  \  I 

v  n 

(B.l)  /  =  exp  -  p\s-n\  +J'zr>  dn, 

—  oo  =1 


where  z  is  the  measurement,  p  -s  are  positive  constants,  and  the  £  -s  are  variables  linear  in  i/.  The  difficulty  in 
evaluating  this  integral  is  that  the  £  -s  must  be  ordered  to  carry  out  the  integration.  However,  since  the  £  -s  are 
functions  of  the  spectral  variables  t/-s  which  take  on  any  value,  any  ordering  of  £  -s  is  possible.  We  show  that  the 
solution  to  this  integral  can  be  expressed  in  closed  form  regardless  of  the  order.  For  convenience,  define 


(B.2)  £o  =  -oo,  ft+1  =  oo. 

To  carry  out  the  integral,  we  first  assume  that  <f  <  ft  for  all  £  <  /  with  (£,  /)  e  { 0 , . . .  ,n  +  \}.  In  this  case  the 
integral  (B.l)  can  be  decomposed  into  the  sum 
(B.3)  □ 


\ 
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/  = 


/=  o 


exp 


p  if  -n\  +Jzn  dn 


6m 


exp 


n  X  .  I  ^ 

p  (f  -n) sgn(?  -n)  +izn  dn  ■ 

=i  § 

ft  '~°  f; 

Note  that  sgn(£  -  /?)  is  constant  over  the  interval  /?  e  (ft,  ft+i),  V/.  This  constant  depends  on  the  relative  size  of 
the  indices  £  and  /.  With  this  in  mind,  for  /?  in  the  closure  n  e  [ft,  ft+i],  we  can  define  the  sign  function  in  (B.3)  as 


(B.4) 


sgn(£  -ft)  if  //=£ 
sgn(£  -  n)  =  s/  =  -i  if  /  =  £ 


From  (B.4),  for  a  given  /,  the  discrete,  two  indexed  function  s  yis  constant  for  all  £  except  for  one  switch  at  £  =  /, 
i.e.,  s  =  -1  V  £  <  /  and  s  =  1  V  £  >  /'.  Moreover,  it  can  be  concluded  that  s  =  s  V  //=£. 

/  /  /  /'—i 

Using  the  dgfinitions  in  (B.4),  (B.3)  can  be  restated  in  terms  of  s,  and  integrated  as 
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The  first  sum  in  (B.5)  can  be  manipulated  as  follows 

□  (  V  □  □  (  )  □ 
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~  1  and  s.  =  s.  V  //=  £  the  above  result  can  be  simplified  as 
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In  the  above,  the  zero  term  p/(f/  -  f/)s'  =  0  was  dropped  from  the  sum  in  the  exponent. 

The  second  sum  in  (B.5)  can  be  manipulated  in  a  similar  fashion  to  yield 

□  (  Y, 

n  n  exp  -  p  (f  -  f;)s  +yz£,  □ 
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Substituting  (B.7),  (B.8),  and  sy=  sgn(£  -  <f/)  V  £/=  /  of  (B.4)  into  (B.5),  the  solution  to  (B.l)  is  expressed  as 

□  □  !□  □  □ 
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where 
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Changing  the  ordering  of  the  <f  -s  caused  by  changes  in  the  v-s  does  not  change  the  solution  of  the  integral 
given  by  (B.10).  Effectively,  the  sgn  function  in  the  exponential  and  denominator  make  the  integral  independent 
of  the  ordering  of  the  <f  -s  and  thus  giving  a  general  result  for  the  integral  of  (B.l). 


B.2.  Generalized  Integral.  In  the  second  and  subsequent  measurement  update  steps  we  encounter  a  more 
general  integral  of  the  form 


<  n  \  n  n 
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where  Q  could  be  m-dimensional  vectors.  We  use  the  same  methodology  as  before  to  solve  the  integral  in  (B.l  1). 
Using  the  definitions  of  fo  and  £n+ 1  in  (B.2),  and  those  of  sy  in  (B.4),  the  integral  is  restated  as 
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It  is  integrated  as 
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and  manipulated  as  before  to  yield 
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Substituting  s  ,•=  sgn(£  -  £;)  V  £/=  /,  the  integral  becomes 
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Appendix  C.  Time  Propagation  of  Characteristic  Functions  for  Linear  Systems.  In  this  appendix 
we  consider  a  multi-stage  propagation  of  a  random  vector  through  a  linear  system,  i.e.  Xk+ 1  =  Ox*t  T  where 
x/c  e  Rn  is  the  system  state,  the  initial  conditions  xi  e  Rn  and  process  noise  Wk  e  Rm  are  independent  random 
vectors  with  given  pdfs  fx1  xi  and  fw  w  ,  and  characteristic  functions  cpx \  Vx  and  cpw  vw  ,  respectively.  Note 
that  here  Vx  e  Rn,  while  \/w  e  Rm.  For  simplicity,  the  pdf  and  hence  the  characteristic  function  of  w  are  assumed 

to  be  constant  for  all  k.  0  e  Rnxn  and  T  e  R,7Xm  are  known  matrices,  with  |0|  £  0. 

Assume  that  at  step  k  the  state  pdf  and  characteristic  function  are  given  by  fxk  Xk  and  (pxk  vx  .  The  pdf  of 
Xk+ 1  is  determined  by  using  the  linear  transformation 


(C.l) 
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in  the  joint  density  fxkw  Xk,  Wk  =  fxk  Xk  fw  Wk  and  finding  the  marginal  density  [11]  as 
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The  characteristic  function  of  Xk+ 1  is  given  by 
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Interchanging  the  order  of  integration,  and  using  the  substitution  Xk+ 1  =  O Xk  +  Twk  =>  dxk+ 1  =  |®  Idxx,  the 
integral  in  (C.3)  is  solved  as 


: 

oo  oo 


□ 


(C.4)  <Px,,t1  v  =  -  fXk  Xk  e>vI(<PXk+rwk)dXkBfw  Wk  dwk 

—  OO  — oo 

n  □□  □ 

D  CO 

=  -  fxk  Xk  ejv'cx>XkdXk  fw  Wk  eJvI  Vwkdwk  -  =  (pxk  0Tv  cpw  r7v 


—  oo 


—  oo 


B  Appendix 


Mo  she  Idan  and  Jason  L.  Speyer 

Multivariate  Cauchy  Estimator  with  Scalar  Measurement  and  Process  Noises 
Precedings  of  the  2013  IEEE  Conference  on  Decision  and  Control 


47 


Multivariate  Cauchy  Estimator  with  Scalar  Measurement  and  Process  Noises 


Moshe  Idan 
Aerospace  Engineering 
Technion  -  Israel  Institute  of  Technology 
Haifa,  Israel,  32000 
Email:  moshe.idan@technion.ac.il 


Jason  L.  Speyer 

Mechanical  and  Aerospace  Engineering 
University  of  California,  Los  Angeles 
Los  Angeles,  California  90095-1597 
Email:  speyer@seas.ucla.edu 


Abstract — The  conditional  mean  estimator  for  a  n-state  linear 
system  with  additive  Cauchy  measurement  and  process  noises  is 
developed.  For  the  multi-variable  system  state,  the  characteristic 
function  of  the  unnormalized  conditional  probability  density 
function  is  sequentially  propagated  through  measurement  up¬ 
dates  and  dynamic  state  propagation,  while  expressing  the  result¬ 
ing  characteristic  function  in  a  closed  analytical  form.  Continuity 
of  this  characteristic  function  and  its  first  two  derivatives  at 
the  origin  of  the  spectral  variable  is  proven.  It  is  then  used  to 
determine  the  desired  conditional  mean  and  conditional  variance 
in  a  closed  analytical  form  to  yield  the  sequential  state  estimator. 
A  three-state  dynamic  system  example  demonstrates  numerically 
the  performance  of  the  Cauchy  estimator. 

1.  Introduction 

In  many  engineering,  economic,  telecommunications,  and 
science  applications  the  underlying  random  processes  or  noises 
have  significant  volatility,  which  is  not  captured  by  the  light¬ 
tailed  Gaussian  probability  density  functions  (pdf)  [1],  Heavy¬ 
tailed  distributions  have  been  shown  to  better  represent  these 
volatile  random  fluctuations.  Examples  are  radar  and  sonar 
sensor  noise  [2]  and  air  turbulent  environment  noise  [3].  Our 
objective  is  to  develop  a  filtering  technique  for  linear  dynamic 
systems  with  heavy-tailed  distributed  noises  while  using  a 
particular  distribution  out  of  the  class  of  symmetric  a-stable 
distributions  [4].  Particular  distributions  of  this  class  are  the 
Levy,  Gaussian  and  Cauchy  distributions. 

The  use  of  heavy  tail  (Sa-S)  distributions  was  demon¬ 
strated  to  yield  improved  filtering  and  detection  results  when 
processing  radar  signals  [5]— [7],  radar  glint  [8],  data  in  a 
multi-user  communication  networks  [9]  and  aircraft  naviga¬ 
tion  [10].  These  results  and  the  degraded  performance  of 
the  standard  Gaussian  estimators  when  exposed  to  impulsive 
noises  motivated  the  derivation  of  a  sequential  non-linear 
estimator  for  scalar  linear  dynamic  systems  with  additive 
Cauchy  process  and  measurement  noises  [11].  This  result  was 
based  on  propagating  the  conditional  pdf  (cpdf)  of  the  system 
state  given  the  measurement  history. 

Unfortunately,  the  recursion  scheme  for  generating  the  cpdf 
directly  for  a  scalar  linear  system  [11]  does  not  generalize 
to  the  vector  state.  In  [12]  a  similar  estimator  was  derived 
by  generating  the  characteristic  function  of  the  unnormalized 
cpdf  (ucpdf)  in  a  recursive  scheme.  This  approach  is  somewhat 
simpler  than  the  scheme  in  [11],  allows  for  a  stronger  result 
regarding  the  decay  of  the  estimator  parameters  with  time,  and 
can  be  generalized  to  the  multi-variable  case. 

The  generalization  of  the  characteristic  function  of  the  ucpdf 

approach  [12]  to  the  multi-variable  case  is  the  essence  of  this 


paper,  which  is  organized  as  follows.  The  estimation  problem 
for  a  D-dimensional,  discrete-time,  linear  system  forced  by 
scalar  Cauchy  process  noise  and  a  scalar  measurement  with 
additive  cauchy  measurement  noise  is  formulated  in  section 
II.  In  section  III,  the  characteristic  function  for  the  ucpdf 
of  the  system  state  conditioned  on  the  measured  history  is 
computed  sequentially  for  the  first  measurement  update  and 
a  time  propagation  step.  This  motivates  the  general  form  of 
the  characteristic  function  for  the  ucpdf  given  in  section  IV, 
which  is  shown  to  be  continuous  and  twice  differentiable  in 
section  V  and  yields  closed-form  analytical  expressions  for 
the  minimum  variance  estimate  of  the  states  and  the  estimation 
error  conditional  variance.  In  section  VI  a  three  state  system  is 
used  to  exemplify  the  performance  of  the  sequential  estimator. 
Concluding  remarks  are  given  in  section  VII. 

II.  Problem  Formulation 

We  consider  the  single-input-single-output  multivariable  lin¬ 
ear  system 

Xk+i  =  Qx/t  +  fwk,  zk  =  Hxk  +  vk,  (1) 

with  state  vector  Xk  G  R'7.  scalar  measurement  Zk,  and  known 
matrices  O  G  Rn*n,  T  G  Rnxl,  and  H  G  Rlx".  The 
noise  inputs  are  assumed  to  be  independent  Cauchy  distributed 
random  variables.  Specifically,  Wk  is  assumed  to  be  Cauchy 
distributed  with  a  zero  median  and  a  scaling  parameter  yS  >  0. 
Similarly,  Vk  has  a  Cauchy  pdf  with  a  median  of  zero  and  a 
scaling  parameter  y  >  0.  The  characteristic  functions  of  these 
scalar  noises  are  assumed  to  be  time  independent  and  given 

by 

<Pw  v  =  e~^v\  cpv  v  =  e“v'|l/l  (2) 

These  characteristic  functions  have  a  scalar  argument  V.  The 
initial  conditions  at  k  =  1  are  also  assumed  to  be  independent 
Cauchy  distributed  random  variables.  Specifically,  each  i-ih 
element  Xi  ;  ofthe  initial  state  vector  Xi has  a  C auchy  pdf  with 
a  zero  median  and  a  scaling  parameter  a,  >  0,  /  =  1, n. 
The  characteristic  function  of  the  joint  pdf  of  the  initial 

conditions,  which  is  a  function  of  a  D-dimensional  spectral 
variable  V  G  Rn ,  is  given  by 

„  /  n  \ 

n 

cpxi  v  =  e  a‘'v'\  =  exp  -  or/|v,] 

/=  i  i=  i 

[',  exp  -  n  p1  \(a\v)\  +j(b\v)  .  (3) 

/  /  1 

/=  1 


The  last  form  was  introduced  for  notational  convenience  to  be 
used  in  the  sequel.  We  used  the  definitions 

Pi  =  cti,  a,  =  e-h  i=l,...,n,  =  {0}n,  (4) 

where  e,  is  a  D-dimensional  /- th  unity  vector  and  {0 }n  is 
D-dimensional  vector  of  zeros.  With  this  formulation  of  the 
system,  our  goal  is  to  compute  the  minimum  variance  estimate 
of  Xk  given  the  measurement  measurement  history,  i.e.,  yk  = 

zi  ■  ■  ■  zk  . 

111.  Initial  Derivations 

The  method  proposed  to  solve  this  Cauchy  estimation  prob¬ 
lem  entails  propagating  the  characteristic  function  of  the  cpdf 
of  the  state  vector  given  a  history  of  measurements.  Evaluating 
this  characteristic  function  and  its  derivatives  at  the  origin  of 
the  spectral  vector  v  will  provide  the  desired  state  estimate  and 
its  error  variance.  The  characteristic  function  is  initiated  by 
the  expression  given  in  (3).  It  changes  during  a  measurement 
update  when  a  new  measurement  is  processed,  and  during  time 
propagation  affected  by  the  process  noise  input.  We  begin  by 
showing  how  this  characteristic  function  is  computed  for  the 

first  measurement  update  and  time  propagation  step.  This  will 
suggest  the  general  form  of  the  characteristic  function  and 

hence  the  general  estimator. 

A.  Measurement  Update  at  k  =  1 

The  characteristic  function  of  the  initial  state  Xi  conditioned 
on  the  initial  measurement  Zi  =  Hx i  +  Vi  is  given  by 
r 

(Px^z,  v  =  fxi\zi  xi|zi  eJvTx,dxi'  (5) 


where  cpv  ■  and  (px1  ■  were  defined  in  (2)  and  (3),  respec¬ 
tively.  Hence  (9)  can  be  restated  as 

|^>o  I  /  \ 


Qj(b\,v) 


jhrf.v)  -  Ha\r\ | 


~Y\n\  +j  zi  —  Hb{  n  dr 7.  (10) 

This  integral  is  solved  using  the  general  result  presented  in 
Appendix  B.  For  that,  the  coefficients  of  rj  in  the  absolute  value 
term  of  (10)  have  to  be  normalized  to  one.  Clearly,  this  cannot 

be  attained  if  some  of  these  coefficients  are  zero.  Hence,  we 
assume  that  Ha 1  /=  0,  /  =  1, . . . ,  n,  or  equivalently  that 
all  ' 

the  elements  of  H  are  nonzero.  With  this  assumption,  (10)  is 
restated  as 

'  Xl|Zl  v  =  Prr  x 

r  n+ 1 

x  exp  -  Pi\(iJi,v)  ~n\+Kin  dn,  (ii) 

— 00  i  1 

where  we  defined 

Pi  =  p)\  Ha)  \,  pi  =  a)/{Ha),  i=l,...,n, 

Pn+l  =  y,  Pn+1  =  {0 }n,  (l  =  Z\  ~  Hb\. 

Using  the  result  in  Appendix  B,  while  associating  and  z  in 
the  later  with  (fJi,  v)  and  (fi,  respectively,  the  integral  in  (11) 

is  evaluated  as 

n+1  (  ( 

1  x,\z,  v  =  g}d  wi\Hv)  exp  y^(v)  ,  (13) 


where  fXl|Zl  Xi|zi  =fv  zi  -  Hxi  fx,  Xi  /fz,  Zi  and  where  the  coefficient  functions  g)l\ ■)  are  given  by 


fZl  Zi  =  fv  Z 1  -  Hxi  fXl  Xi  dx  1.  (6) 

—  00 

The  subsequent  derivations  can  be  simplified  by  avoiding  the 
division  by  fz^  Z\  ,  thus  addressing  the  unnormalized  cpdf 
and  its  characteristic  function  defined  by 

f  xi|zi  =  fx  \z  *i\zi  fz  zi 

*i\Zi  ii1  1 

=  fv  z  1  —  Hx  1  fXl  xi  ,  (7) 


a 111 V1 


'  =  2tt 


J_  (  HI 

Kl  +  Pi  +  Ygi  (  V) 

In  ( 

-  j(i  -  pi  +  yllhv ) 


The  arguments  of  g,-  (■)  and  the  exponents  in  (13)  are 
y*'.V)  =  n+1  P  sgn  {(a;  ,  v) ) , 


I  X,|Z,  V  =  fv  Zi-Hxi  fx,x  1  e>v  X'dx  1.  (8) 

—  oo 

When  the  actual  (normalized)  functions  are  needed  for,  e.g., 
computing  the  state  estimates,  the  normalization  factor  can  be 
easily  determined  by  evaluating  <^>Xl  xi|zi  at  v  =  { 0}n , 

i.e.,  fz-t  zi  =  ^PXl| Zl  {Q }n  ■ 

Using  the  derivation  in  Appendix  A,  the  D  integrals  in  (8) 
can  be  expressed  as  the  single  integral 

r° 

(Xiiz,  v  =  2W  V*  v-HTt]  (pv  ~n  e/Zindn,  (9) 


p  \(a,  ,v)  \  +j(b,  ,v), 


where,  while  using  (12),  for  R/=  i  we  have  defined 

U  1  1 

u  — - //=  n  +  1,  R/=  n  +  1 


a,  =  rfa1  Ha 1  i=  n+1 

□  a 

n  R  =  n  + 1 


111  _D  zr-Hb-  - 
bi  ~  nbl  Ha> 

i 


a'  +  b1  i  /=  n+1 


i  =  n  +  1 


2 


To  simplify  the  notation  and  subsequent  derivations,  specif¬ 
ically  to  avoid  the  R  /=  /  exclusion  in  the  sums  of 

(15), 

the  elements  in  these  sums  are  renumbered  sequentially,  i.e., 
Re  [1 ,  n]  for  each  /-  th  term.  The  renumbered  vectors  a,  will 
be  denoted  by  a1.1.  To  accommodate  subsequent  derivations, 
the  coefficients  p  in  the  sums  of  (15)  will  be  marked  differ¬ 
ently  for  y1|1(-)  and  y1'1(-):  we  will  use  q1'1  for  the  former 


and  pj"'1  for  the  latter.  The  parameters  z  and  p;  in  (14)  will  be 

denoted  by  C/'1  and  dj '1,  respectively.  Finally,  the  number  of 

terms  in  the  sum  of  (13)  will  be  denoted  by  ni'1  =  n+ 1,  while 
the  number  of  elements  in  the  sums  of  (15;  will  be  marked 

by  a1'1  111 

J  ej  =  n.  Although  at  this  stage  all  the  counters  n ^  are 
the  same,  as  will  be  seen  in  the  sequel,  they  may  be  different 

ei  t 

and  n^1  seem  to  be  related  to  each  other  (the  former  is  n  +  1 
while  the  latter  are  n).  they  will  exhibit  irregular  changes  and 

thus  were  introduced  separately. 

With  these  notations  and  renumbered  parameters  and  vec¬ 
tors,  (Pxi\Zi  v  is  restated  as 


9' 


(g/1!1  v 


/=  l 


i|i  ^  i|i 

oiM  exp  ye,  (v) 


(17) 


where 


gm  yn 

i  a' 


l  ( 

_1  v  i|i  i|i  i|i  -i 

.  (V)  =  ipi  +di  +  y(v) 

1  [  1  2tt  (  I 

111  111  111  -1 

—  fa  dj  +  ygi  ( v)  (18) 


and 


the  ucpdf  of  X2  given  Zi  is  given  by 
1  X2IZ1  v  =  <  xqz!  <Pw  fTv 


(20) 


( 

g1'1  y* 


( 


111  T 


(19a) 


n W  ( 

vtfiM  =  O/'^gn  (a)ll,v)  , 

=1 

nlf1 

1|1  1|1  1|1  1|1 
e,  ('/)  =  -  Pi  I  fa,  ,v)\+j(bi  ,v).  (19b) 

=1 

It  is  interesting  to  note  that  the  initial  characteristic  function 
in  (3)  is  also  expressed  in  a  form  identical  to  that  given  in 
(17).  The  initial  parameters  are:  n1  =  1;  g1  =  1  (and  thus 

there  are  no  y11  arguments);  =  V);  the  parameters  p1.  and 
the  vectors  a1  %nd  b 1  are  givenein  (4). 

/ 

Remark  3.1:  The  special  cases  when  some  of  the  conditions 
Ha  /  /=  0,  /=  1, do  not  hold  can  be  handled 
similarly, 

to  yield  results  that  resemble  those  presented  above.  These 
cases  are  not  presented  here  for  brevity. 

B.  Time  Propagation  to  k  —  2 

Given  the  time  propagation  equation  X2  =  Oxi  +T vvi,  while 


(=1  7  gi(®  v)  exp  yei  (O  v)-fi\(yv)\  . 

The  last  expression  in  (20)  can  be  expressed  in  a  form  that 
is  similar  to  the  one  given  in  (17).  For  that,  we  make  the 
following  observations  that  lead  to  the  definitions  of  new  and 
time  propagated  parameters  and  vectors. 

1)  1  X2IZ1  v  is  expressed  with  the  sajj^e  nunjj^er  of  terms 

as  <£)  ._  v  .  Hence,  we  define  nt  =  nt  . 

Xi  |Zi 

2)  The  coefficient  functions  gf^1  ( ■),  or  specifically  the  jja- 

r^yetersy  and  d,  ,  rem^y  as  in  (18),  i.e.,  C,  =  C,  , 

dj  =  dj  ,  i  e  [1, . . .  ,nt  ].  There  are  only  changes 

in  the  arguments  of  gf1'  (■).  For  notational  consistency 
!  ?  1 1 
we  will  denote  the  updated  coefficient  functions  g  \  ( ■) 

while  remembering  that 

n2|i  ill 

in  /  (0  £?!»/  (‘)-  in,  (21) 

3)  The  parameters  q  and  p  used  to  define  y  ( J  and 

yi|i  ‘  1  2\P  1|1 

e j  (■)  in  (19)  are  also  unchanged,  i.e.,  p,-  =  p,  , 

2|i  i|i  2|i  i|i 

qi  =  di  ,  /€  [1,1(1 .,nt  ],  R  e  [l,...,nej  ]. 

4)  The  arguments  of  y  ( ■ )  and  y  ( ■ )  are  multiplied  by 

</' 


e'i|i  i|i 

0T ,  hence  affecting  the  vectors  a,-  and  b ,•  as 

t| Wt  .  nW  2|1  ^  2|1 

y  (0  v)  =  q,  sgn  (a,  ,v )  , 


(22a) 


,iji(Ori/)  =  -  pf\(af,  v)  |  +j(bf\  v),  (22b) 


\/e, 


where  we  used  the  definitions  a2'1  _  ^ _i|  1  u2\i  _ 
1|1  2|1  •  1|1  4,8  i  ’  °i 

Obj  ,  /  e  ],  Re  [l,...,n  •  ]. 

5)  The  exponents  are  a  function  of  an  additional  element 
— /3|(T,  v)\.  Hence,  the  number  of  elements  that  de¬ 
fine  the  new  argument  yej  (■)  increase  by  one,  i.e., 
tiei  =  nei1  +  1-  The  parameters  and  vectors  that 


define  these  new  elements  are  p2^(1  =  C,  a2'1,,.  =  T, 

in£  ini’? 

i  e  1, . . . ,  n2}1].  For  consistency,  and  to  facilitate  the 
subsequent  manipulations  of  the  characteristic  function, 
the  number  of  elements  in  the  sum  of  the  new  arguments 

y^jV)  are  also  increased  by  one,  while  introducing  the 
zero  parameters  q  =  0,  /  e  [1, . . . ,  nt  ].  Hence, 

irigi 

with  these  new  elements,  the  arguments  y2g](')  and 
■)  are  defined  as 


n^-1  ( 

1^1(1/)=  q2|1sgn  (a2ll,v) 

=  1 

ntii 

2|1  2|1  2|1  2|1 
using  the  r  esult  of  Appendix  C,  the 


(23a) 


characteristic  function  of 


ye,  ('/)  =  -  Pi  I  (a,  ,  v)  |  +j(bi  ,v).  (23b) 


and 


Using  all  the  time  propagated  and  newly  defined  parameters, 
(20)  is  restated  as 


ip  2\i  v  =  g2'1  2,1 

Yx2\ Zi  /  gi(  v)  exp  yej  (v)  ,  (24) 


where 

g2\l  (y2|l 

/  giM 


1  '(  2|1  2|1  2|1  -1 

=  —  jc;  +dj  +y  i( v) 

/  &  a 

n  —  jtf1  -  cff]1  +  yllHv)  _1  (25) 


with  arguments  y2'.1(-)  and  y2'.1(-)  given  in  (23).  Overall,  we 

Ql  ©/ 

have  obtained  a  form  which  is  similar  to  the  one  in  (17), 
determined  after  the  first  measurement  update  in  the  previous 
subsection. 


C.  Summary  of  Initial  Results 

The  above  derivations  demonstrate  that  the  characteristic 
function  of  the  ucpdf  of  the  state  Xk  at  time  steps  k  =  1  and 

2  is  expressed  as  a  sum  of  rit  weighted  exponential  terms. 
The  number  of  terms  increases  during  a  measurement  update 
and  are  unchanged  during  a  time  propagation  step.  In  each 
such  term  /’,  the  exponents  and  their  weights  or  coefficients 
are  functions  of  a  sum  of  nei  elements.  The  number  nei 

of  elements  normally  increases  during  the  time  propagation 
step  and  are  unchanged  during  a  measurement  update.  Those 

observations  provide  the  insight  and  guidance  on  how  to 
construct  the  characteristic  functions  at  any  time  step  k,  as 
is  discussed  next. 


IV.  Measurement  Update  and  Time  Propagation: 
General  Case 


The  initial  results  presented  in  the  previous  section  suggest 
the  general  form  of  the  characteristic  function  of  the  ucpdf  of 
the  state  at  any  time  step  k  given  a  measurement  history.  Con¬ 
sequently,  we  assume  that  at  any  time  step  the  characteristic 
function  of  the  ucpdf  of  the  state  Xk  given  all  the  data  history 
up  to  k,  i.e.,  yk  =  Zi  ■  ■  ■  Zk  ,  is  expressed  as 


qyk\k  v 
xk\  Yk 


/=  1 


9‘ 


k\k  k\k 

gi  M  exp  ye.  (V)  ,  (26) 


where 


9-'k  ykg]k(v) 


gk-i\k-i  {k\k,v)  +  hk\k 

=  1  □  f'k  91 

- TZjyk - k\k - k\k - 


JC 


t  di  +ygii(v) 
V  k\k 


k-l\k-l  'k\k 

_,*/*  o,l  “  hi 

J(\k - k\k - k\k- 


1C' 


-dj  +ygi2M 


j 

~L  ’ 


(27) 


nT  ( 

ykg\k(v)=  ^qfk sgn  (a^k,v)  e  Rk ,  (28a) 

nklk 

k\k  el  k\k  k\k  k\k 

ei  ('/)  =  -  Pi  I  (a/  ,v)\  +j(bi  ,v).  (28b) 

=i 

As  will  be  explained  below,  yfc|^(-)  and  qk\k  are  k  dimensional 

vectors.  When  evaluating  gk]k  yk^k  ( ■ )  in  (27),  the  argument 

yj  ( ■ )  is  partitioned  as  follows:  yg,-jt  ■ )  is  a  A  —  1  dimensional 
vector  constructed  from  its  first  k  —  1  components,  while 

the  scalar  y^U)  is  its  last  component.  Also,  the  source  of 
the  indexes5"^1  *  k\k 


i  and  offsets  h,  will  be  detailed  in  the 
measurement  update  step.  The  initialization  of  this  function 
at  k  =  1  is  performed  based  on  the  results  derived  in 
subsection  III-A.  Now  we  will  perform  one  time  propagation 
and  one  measurement  update  and  show  that  the  above  form  is 
maintained  at  any  time  step. 

A.  Time  Propagation  from  k  to  k  +  1 

The  characteristic  function  <PXfc+1  \Yk  v  °f  the  time  propa¬ 
gated  state  Xk+ 1  =  O Xk  +  VWk  is  determined  using  the  result 
in  (C.3)  of  Appendix  C  and  is  expressed  as 

'Xfc+i| Yk  v  =  T Xk\ Yk  ®Tv  <Pw  fTv  (29) 

nk!k  (  ( 

_  f  k\k  k\k  T  k\k  T 

/  g,  (<D  V)  exp  ye.  (O  v)-p\(r,v)\  . 

i=  1 

The  arguments  of  the  coefficient  functions  gk}k  (■)  above  and 
those  of  the  exponents  in  (29)  are  redefined  as 

( 

=  qfk sgn  (Oafk,v)  ,  (30a) 

=  i 

/+1|V)  =  -  nJl  pkV‘\(Qaklk,  v)\  -  p\(r,  v)\  +j(bk'k,  v). 

ei  i  /  / 

=  1 

(30b) 

With  these  definitions,  (29)  can  be  restated  as 

nk/k  (  ( 

nt  k\k  k+l\k  k+l\k 

v  =  g.  ygi  ( V )  exp  yej  ( v )  , 

/=  i 

(31) 

which  clearly  has  a  form  similar  to  (26).  Specifically,  defining 
the  time  propagated  parameters  similarly  to  the  five-steps  pro- 
cgc|ijr£  outbid  in  se^ip^  III-B  ^hiding  the  new  parameters 

r,-  '  =  r;-'  ,  hi  '  =  hj  the  result  above  can  be 

restated  as 

nk*MK  (  ( 

<P  v  =  k+i\k  yk+i\k(v)  exp  /+il k(v)  , 

Xk+ 1  \^-k  cJi  J  gi  J  ei 


/=  1 


(32) 


where 


measurement  updated  characteristic  function  is  expressed  as 


gk+i\k  yk+i\k 

i  <)'  ( 

k-l\k-l  \j<+l\k  k+l\k 

1  ^  d^Vk  g/l  ( v)  +  hi 

~  k+l\k  k+l\k  k+l\k 

1  ,  +  ft  *y,a  M  D 

fk+l/k  y 

- krv,-drv*+y%"*M® (33) 

and 


^  ei  /c+  1  \  k 

Jjtn\k{v)=  qk+i\k sgn  ( a /  ,  W  e  R/b  (34a) 

=i 

fc+1  Ik 

J<+1\ k  ”e'  fr+l|/c  fc+l|fc  fc+l|fc 

e/  ('/)  =  -  Pi  I  fa  ,v)\  +j(b;  ,v). 

=  1  (34b) 

The  number  of  terms  that  define  (0  „  .  _  1  in  (32)  is 

-*7c+1  l^/f 

identical  to  that  of  cp  -in  (26).  However,  the  number 
yCk\Zk 

of  elements  needed  to  define  the  arguments  of  those  terms  has 
increased  by  one,  i.e.,  nk+1^k  =  nkk  +  1. 

B.  Measurement  Update  at  k+  1 

The  measurement  Zk+ 1  =  HXk+i  +  Vk+i  is  processed  next 
to  determine  cpx k+1 1  w,  ■  ,  where  y/c+i  =  Zi  ■  ■  ■  Zk+i  . 
Using  the  general  result  in  Appendix  A, 

'xw|Yk+i  v  =  (35) 

r  ( 

1  -  v-HTn  exp  -Y\n\+Jz  n  dr], 

:Sf-  <Py  lv  k+l 

2tt  x^+1 ' Yk 

- oo 


where  <  x*+i  lz*  '  was  determined  in  (32).  This  integral  was 
solved  in  (B.2)  of  Appendix  B.  This  result  indicates  each 
term  in  the  sum  of  cp  -in  (32)  generates  nej  'k  +  1 

Xk* 1  I  Zk 

terms  in_<  xk*-\\  Yk+1  v  :  the  index  /-k+1^k+1  indicates  which 


term  of  <  xk*i  \zk  ■  generated  the  new  ones.  In  addition,  this 
parent  term  is  called  with  an  offset,  denoted  as  Q,  in  (B.2).  In 
the  final  result,  this  offset  will  be  denoted  as  /7^+1lfc+1  Finally 
we  note  that  the  coefficient  function  of  the  exponents  in  (B.2) 
have  two  input  arguments:  the  sum  in  the  numerators  (with  the 
offset)  and  the  sum  in  the  denominators.  Grouping  those  two 
arguments  into  one,  w^cjerjqte  the  inmiT)/|^tor  to  the  updated 
coefficient  function  g  (■)  as  y  (■).  Clearly,  each 

/  ei 

measurement  update  increases  the  dimension  of  this  argument, 
and  hence  the  deduction  in  (28)  that  yK(-)  and  q  ^  are  k 


gi  i 

dimensional  vectors.  Similarly,  the  updated  yH1  lfc+1(-)  and 
ji+Hk+i  willbe  of  dimension  k+  1. 

After  the  integration,  reordering  terms  and  defining  updated 
parameters  similar  to  the  procedure  outlined  in  III-A,  the 


/  Xk+i  ,V/f+i 

‘  I  V  = 


,k+'\lk+'\  , 

f  ( 

k+l\k+l  k+l\k+l 

y/  y 


gi 


(V) 


/=  1 


k+l\k+l  ,  , 

x  exp  yei  ( v ) 
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( 

:k+l\k+l  yt+l|lf+l 


M 


,k\k 


( 


(f+l|fc+l 


k+l\k+l 


lt  g/l  M+hi 

=  2rT  J  irk+  lb+1 - k+±\k+i - k+l\k+l - 

1  +d;  +  ygi2  M 


( 


k\k  k+l\k+l /^\  _  ^  k+l\k+l 

k+ 1 1  k+ 1  Vgii 


□ 


J+l\k+l 


JC 


— k+l\k+l 

d; 


ft+llfc+1 


and 


k+i\k+i 

%/  M  = 


=1 


(36) 


(37) 


+  Ygi  2  M 

/c+l|/c+l  ^  k+l\k+l 

q )  sgn  ( a ,  ,v)  »  (38a) 


tk+1  lk+-\ 

k+l\k+l  k+l\k+l 

Pi  I  (a,  ,v)\ 

+j(bk+i\k+i  v)  ■  (38b) 

The  above  results  clearly  show  that  the  form  of  the  character¬ 
istic  function  proposed  in  (26-28)  for  the  time  step  k  is  also 
maintained  at  k  +  1. 


V.  Conditional  Mean  and  Estimation  Error 
Variance 


The  minimum  conditional  variance  estimator  of  Xk  given 
the  measurement  sequence  yk  -  Z\  ■  ■  ■  Zk  is  the  conditional 
mean  of  Xk  given  yk .  It  can  be  determined  by  evaluating  the 
characteristic  function  of  (26)  and  its  derivatives  at  v  —  {0}n, 

or  as  v  — »  {0 }n-  In  this  section  we  show  that  (pxk\Yk  ’  is 
twice  continuously  differentiable  and  give  explicit  expressions 

for  the  conditional  mean  and  the  estimation  error-variance. 
The  continuity  of  the  first  two  derivatives  of  <p  -is 

proven  by  induction.  The  characteristic  function  cp  Xk'Yk  v  is 

Xi  |Zi 

given  by  the  convolution  integral  in  (11).  Using  the  definitions 


in  (4)  and  (12),  (11)  is  rewritten  as 


1  Xi|Z,  v 


n 


n  cn\vi-n\e-Y\n\+jzMiclq 
i=le~ 


(39) 


\T 

where  i 7/  [',  (pi,  v)  and  v  =  V\  ■  ■  ■  vn  _ .  The  integrand 
of  (39)  is  a  continuous  function  of  q  and  v/V/  =  1 , . . . ,  n. 


Moreover,  it  can  be  shown  that  its  first  and,  by  a  change  of 
variables,  its  second  order  partial  derivatives  with  respect  to 
any  Vj  are  piecewise  continuous  and  bounded.  Consequently, 
(Px1 1 z i  v  is  twice  continuously  differentiable  with  respect  to 

v  [13].  Since  V  is  a  linear  function  of  V ,  (fixqzi  v  is  twice 
continuously  differentiable  with  respect  to  v. 
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The  time  propagated  characteristic  function  at  k  —  2  of  (20) 
is  restated  as 

m  v  =  <p  G>tv  e_/3|rT'/|.  (40) 

TX2\  Zi  Xi\Zi 

Since,  as  it  was  shown  above,  <Px-i\Zi  v  is  twice  continuously 
differentiable,  <PXl\z1  is  also  twice  continuously  dif¬ 

ferentiable  for  any  transition  matrix  O.  Clearly,  (p  x2|Zi  v  is 
continuous,  being  a  product  of  two  continuous  functions.  How- 

IY^=sif0et  thfh^s^gftY§^f  ^f2'rx21  it  “a&fSPfifjfflMB 

any  moments.  This  implies  that  we  cannot  compute  a  priori 
estimates  of  the  state  x2  given  only  the  past  measurement  zi. 

The  characteristic  function  at  the  second  measurement  up¬ 
date  given  the  measurement  history  y2  =  {z\,  Z2},  while  using 

the  explicit  form  of  (40)  is  given  by 

•  x2|Y2  v  = 

r 

1  ~  (v  -  HT q)e~^v  Tv-HVn\e-Y\n\+izin (jq  ( 4 1 ) 

2ir 

—  00 

where  (p  ( v )  [’,  <j 0  ®Tv  .  Note  that  if  Hf  =  0, 

X2|Zi  Xi\Zi 

T 

the  term  e“^lr  4  would  come  out  of  the  integral  in  (41). 
Consequently,  in  this  case  <Px2\z2  v  would  not  be  contin¬ 
uously  differentiable  with  respect  to  V ,  and  there  would  be 
no  minimum  variance  estimate  of  Xk  given  yk  for  all  k  >  2. 

Therefore,  Hr  /=  0  is  a  necessary  condition  for  the 
continuous 

differentiability  of  <Px2|r2  v  and  the  existence  of  the  desired 

estimate  of  the  state  Xk  at  any  time  step  k. 

It  was  shown  earlier  that  <  x2\zAv  ~  HT  l)  is  twice 

continuously  differentiable  with  respect  to  v.  Moreover, 

e-p\vT v-HVq\  a  piecewise  continuous  and  bounded  first 

derivative  with  respect  to  v.  Hence,  assuming  Hr  /=  0,  the  in¬ 
tegrand  in  (4T)  cairbe  shown  to  have  piecewise  continuous  and 

bounded  first  and  second  order  partial  derivatives  with  respect 
to  any  v.  This  implies  that  (Px2\z2  v  is  twice  continuously 
differentiable  with  respect  to  V  [13]. 

In  a  recursive  manner,  the  continuity  of  the  first  two 
derivatives  of  the  characteristic  function  can  be  shown  to  be 
maintained  when  propagating  from  any  time  step  k  to  time 
step  k  +  1,  as  it  was  maintained  in  going  from  times  step  1  to 


well  defined  constants  that  satisfy  |s'|  =  1.  From  (26-28), 
1  Xk\Yk  '  along  this  direction  is  given  by 


ktk 


(  Xk\Yk 


v  = 


91' 


k\k 


/=  1 


91 


( 

exp  y 


XI  k. 


(42) 


k\k 

where  the  functions  a  ( ■)  are  given  in  (27).  Their  arguments, 
defined  in  (28),  are 


ygi  (  v)  =  s  =  ygi  (v).  (43) 

k\k  =1  k\k  i  k\k 

k\k 

Since  the  vector  v  is  constant,  the  arguments  ygj  ( ■ )  and  thus 
k\k/ 

the  coefficient  functions  g /  (■)  are  constant  along  the  chosen 
direction. 

Similarly,  the  arguments  •)  of  the  exponents  in  (42), 
defined  in  (28),  are  manipulated  as  follows 

nklk 

k\k  e'  k\k  k\k  k\k 

Yei  (v)  =  -  Pi  |  (a;  ,  v)  \  +j(bi  ,  v)  (44) 

=  1 

L  k/k 

ne,  k  k  k  k 

k  k  k  k 

=  □-  n  I  '  I  In  1 

=1  /  sfa,  ,v)+j(b'  ,v)  =  (yei(v),v), 


where  we  have  defined 


k/k 


yk'k(v)  =  — pWsfaW+jbW. 


(45) 


_  k\k  a 

Sj^e  v  is  constant,  so  is  the  expression  (yej  ( v ),  v),  making 

yei  ( ■ )  of  the  exponents  linear  in  along  the  chosen  direction. 
Using  the  above  results,  (42)  can  be  restated  as 

nk,k  (  ( 

n-  k\kK  k\k  k\k 

/  ygi(v )  exp  (yei(v),v)  ■  (46) 

i=  1  a 

The  conditional  mean  of  the  state  Xk  given  the  data  se¬ 
quence  yk  is  given  by  ,  x 

7  -  A 

l  dcp  v 

=  =  -  IfV-  -'47> 


(p  0  = 

Xk\Yk 


2  presented,  above.  Assuming  that  K  of  (26)  is  twice 

continuously  differentiable,  one  co lira 'use  the  above  arguments 

to  show  that  so  is  cp  „  i  \  Yk-^  v  . 

Having  establisheatllat  k(p  v  is  twice  continuously 

Xk  \  Yk 

differentiable,  the  explicit  form  of  the  pdf  of  the  measurement 
history,  the  conditional  mean,  and  conditional  variance  is 
determined  from  (26-28)  by  evaluating  <  Yk  v  and  its 
derivative  as  V  — »  {0 }n.  We  choose  V  =  V,  where  is  a 
positive  scalar  such  that  — »  0  and  v  is  any  fixed  direction 
in  the  v  domain  for  which  (ak)k,v)  0,  V  (/,  R).  This 

*  '  (  m  . 
condition  avoids  the  discontinuity  issues  of  sgn  (a  -  ,  v) 

when  evaluating  ^ xm yk  v,  and  its  (jerjy/qdves.  With;  this 
choice  of  v ,  sgn  ( sr"  ,  v)  =  sgn  ( a  ,  v)  [ ,  s'  are 


=0 

The  pdf  fyk  yk  needed  to  normalize  the  above  result  is 
determined  by  evaluating  (Pxk\Yk  ^  at  =  0-  Using  (46) 

n^,k  ^ 

fYk  yk  =  (Pxk\Yk  V  =0  =  9?k  C<*>  ■  (48) 

/=  1 

The  derivative  tf^ja  -is  determined  by  differentiating 
(46).  Since  g  y  xk\Yk 

gj( v)  and  (yej  [v),  v)  are  constant,  this 
derivative  can  be  easily  determined.  Evaluating  it  at  =0 

yields  the  minimum  conditional  variance  estimate 

k/k  ^ 

X*.-1  '  aT  yfw  yfw.  m 

ify. 


/=  1 
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The  second  conditional  moment  of  xk  given  yk  is  determined 
by 

1  d2cp  v 

E[xkxT\yk]=  _  KkiYk _  .  (50) 

k  j2fYkYk  d(  v)d(  v)T  =Q 

The  second  derivative  in  the  above  equation  is  easily  computed 
by  twice  differentiating  (46)  to  yield 


E[xkxT\yk]  = 


(51) 


nk/k 


9 


Mk 


( 


j  fvk  yk  /=1 


yk\k 


k\k 


yj(v) 


Finally,  from  (49)  and  (51),  the  variance  of  the  estimation 

error  pk  =  xk  ~T xk  can  be  evaluated  as  E[xkxTi}yk]  = 
xkx  k\yk  xkx  ^ 


VI.  Three-Dimensional  State  Estimator 


The  performance  of  the  proposed  estimator  was  tested 
numerically.  The  system  parameters  were  chosen  as  follows: 
□  □  □  □  □  □ 

1.4  -0.6  -1.0  0-1  □  □  0.1 

0=  --0.2  1.0  0.5  -,r=  0.3d,  :c?2c  =  ^0.08- , 


0.6  -0.6  -0.2 


-0.2 


H=  1.0  0.5  0.2,  /3  =  0.1, 


03  0-05 

y=0.2. 


The  system  has  stable  eigenvalues  at  0.7  ±  0.3j  and  0.8.  It 
is  observable  and  complies  with  the  necessary  condition  that 
Hf  /=  0. 

The  simulation  results  are  depicted  in  Fig.  1,  showing  the 
estimation  errors  in  1(a),  and  the  cross-correlation,  process 
and  measurement  noises  in  1(b).  It  can  be  clearly  observed 
that  the  estimation  error  standard  deviation  depends  on  the 
measurement  sequence.  Specifically,  the  standard  deviation 
increases  when  a  large  measurement  noise  is  encountered,  e.g., 
at  time  step  #4. 


VII.  Conclusions 

For  a  linear  discrete-time  system  with  additive  Cauchy 
measurement  and  process  noises,  an  analytic  recursion  of 
the  characteristic  function  of  the  unnormalized  condition  pdf 
was  determined  for  the  /1-vector  state  estimator.  It  was  then 
shown  that  this  characteristic  function  is  twice  continuously 
differentiable,  allowing  the  determination  of  the  conditional 
mean  and  the  conditional  second  moment,  from  which  the 
conditional  error  variance  is  determined.  This  conditional 
mean  constitutes  the  Cauchy  estimator  for  the  multivariable 
system.  The  estimator  was  then  evaluated  numerically  for  a 
third-order  example. 
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Appendix 

A.  Characteristic  Function  of  an  Unnormalized  Conditional 
PDF  Given  a  Scalar  Measurement 

For  the  linear  measurement  equation  z  =  Hx  +  v  = 
f=1  hiXj  +  v ,  the  characteristic  function  of  the  unnormalized 

conditional  density  function  of  the  state  is  given  as 

oo 

*  T 

I  x\z  v  =  fx  X  fv  z  -  Hx  eiv  Xdx.  (A.l) 

—  OO 

The  above  resembles  a  Fourier  transform  of  a  product  of  two 
functions:  fx  x  and  fv  z  —  Hx  .  Using  the  dual  convolu¬ 
tion  property,  this  integral  can  be  solved  by  a  convolution  in 
the  v  domain  between  the  characteristic  function  (px  v  of 
fx  x  and  the  characteristic  function  of  fv  Z  —  Hx  ,  which 
we  denote  by  (pv  v  ,  i.e. 

r 

1  x\z  v  =  (px  v -a  (pv  °  da.  (A.2) 

—  OO 

The  characteristic  function  (pv  v  is  determined  as  follows 

r  /  n  X 

n 

(pv  V  =  fv  z  -  hiXi  e>  1=1  v,x'dx.  (A.  3) 

— oo  i  1 

straight  forward  to  show  that 


where  Q  could  be  m-dimensional  vectors.  Assuming  that  the 
constants  f  are  ordered  such  that  f  <  f  +1,  the  integral 
in  (B.l)  can  be  solved  over  the  intervals  f  ^  r?  <  f  +1  in 

which  the  sgn(-)  functions  are  constants  equal  to  ±1  and  the 
absolute  value  can  be  removed  from  the  exponents.  Summing 
those  integrals  yields  the  closed  form  result 

□  □  □  □ 

n  □□  n  J  J 

1=  exp nc-  p  -f/l'D+yzf/a 

/=  1  =i 

□  □,=/  □ 

n 

DQ;  +  Qsgn  (£-$)□ 

/=/_ 

XgyZ  +  P,+  *(?-$> 

□ 


□ 

1 

/ 

□  □ 

n 

□ 

gJ-Qi  + 

Q  sgn(f  - 

§)J  □ 

- 

=  1 
l=j 

□ 

n 

n  ■  (B.2) 

JZ  -  Pi  + 

p  sgn  (f 

-5)  □ 

!=} 

□ 

C.  Time  Propagation  of  Characteristic  Functions  for  Linear 
Systems 

Assume  that  at  step  k  the  state  pdf  and  characteristic 
function  are  given  by  f  and  (p  ,  where  Vx  e  Rn. 

Xk  Xk  Xk  Vx 

The  goal  is  to  determined  the  characteristic  function  of  the 
propagated  state  at  time  step  k  +  1,  given  by  Xk+i  =  Oxk  + 
T Wk.  It  is  assumed  that  Xk  and  the  process  noise  Wk  e  Rm 
are  independent  random  vectors.  The  pdf  and  characteristic 
function  of  the  latter  are  fw  w  and  (pw  vw  ,  respectively, 
where  vw  e  Rm.  O  e  Rnxn  and  V  e  Rnxm  are  known 
matrices,  with  |0|  0.  Due  to  the  linearity  of  the  state 

dynamics,  the  pdf  of  X^+i  is  given  as  [14] 

fxk+ 1  xk+  i  = 

r  '  fxk  (O  Xk+ 1  O  f Wk  fw  Wk  dWk ■  (C.l) 
(p-l  -1  -1 

—  oo 


i  x\ zM  ~  Tfr  f>x  v-HTq  (pv  ^  q)dzndq.  (A.4) 

—  OO 

B.  Integra l  of  an  Exponent  of  Absolute  Values 

The  measurement  update  stage  of  the  Cauchy  estimator 
entails  evaluating  a  convolution  integral  of  the  following  form 

I--  /  \ 

1  n 

1  -  9  =1  Q  Sgn(f  -  n) 

—  °o  - 

1/  \  I 

n 

x  exP  -  p  -r) |  +jzq  dr],  (B.l) 


Its  characteristic  function  is  given  by 

po  po 

<pxk+1  =  O-1  fx\-iXk+1  -  0~1^Wk 

— oo  — oo 

T 

x  fw  Wk  dwkeiv  Xk+Idxk+1.  (C.2) 

Interchanging  the  order  of  integration,  and  using  the  substitu¬ 
tion  Xk+  i  =  tVXk  +  T Wk  =>  dxk+i  =  |  O  |  dxk,  the  integral  in 

(C.2)  is  solved  as 

<Pxk+i  v  =  (pxk  ®Tv  q>w  rTv  .  (C.3) 
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Abstract — An  efficient  recursive  state  estimator  is  developed  for  two- 
state  linear  systems  driven  by  Cauchy  distributed  process  and  measure¬ 
ment  noises.  For  a  general  vector-state  system,  the  estimator  is  based 
on  recursively  propagating  the  characteristic  function  of  the  conditional 
probability  density  function  (cpdf),  where  the  number  of  terms  in 
the  sum  that  expresses  this  characteristic  function  grows  with  each 
measurement  update.  Both  the  conditional  mean  and  the  conditional 
error  variance  are  functions  of  the  measurement  history.  The  proposed 
two-state  estimator  reduces  substantially  the  number  of  terms  needed  to 
express  the  characteristic  function  of  the  cpdf  by  taking  advantage  of 
relationships  not  yet  developed  in  the  general  vector-state  case.  Further, 
by  using  a  fixed  sliding  window  of  the  most  recent  measurements,  the 
improved  efficiency  of  the  proposed  two  state  estimator  allows  an  accurate 
approximation  for  real-time  computation.  In  this  way,  the  computational 
complexity  of  each  measurement  update  eventually  becomes  constant, 
and  an  indefinite  number  of  measurements  can  be  processed.  The 
performance  of  the  Cauchy  estimator  was  demonstrated  numerically. 


I.  Introduction 

Dynamic  processes  involving  uncertainty  are  frequently  encoun¬ 
tered  in  fields  ranging  from  engineering  and  science  to  economics  and 
finance.  It  is  often  assumed  that  the  uncertainties  are  described  by  the 
Gaussian  probability  distribution,  mainly  because  modem  methods 
and  algorithms  are  able  to  handle  such  systems  very  efficiently  [1], 
However,  in  many  applications  the  underlying  random  processes 
have  an  impulsive  character  producing  deviations  of  high  amplitude 
and  small  duration  much  more  often  than  the  Gaussian  assumption 
permits  [2],  Examples  of  such  processes  include  radar  and  sonar  noise 
[3]  and  disturbances  due  to  air  turbulence  [4], 

Impulsive  uncertainties  were  shown  to  be  better  described  by 
heavy-tailed  distributions,  such  as  the  symmetric  alpha-stable  (SaS) 
distributions  [5].  These  distributions  are  described  not  by  their  prob¬ 
ability  density  functions  (pdfs),  but  by  their  characteristic  functions 
(CFs).  They  are  of  the  form  (p(V)  -  e~aa \v\a+^v ,  where  a  is  the 
scaling  parameter,  /J  is  the  median,  v  is  the  spectral  variable,  and  the 
characteristic  exponent  a  detennines  the  type  of  distribution:  a  -  2 
implies  the  Gaussian  distribution,  and  Cf  =  1  implies  the  Cauchy 
distribution. 

Estimation  assuming  Cauchy  distributed  noises  has  shown  im¬ 
proved  perfonnance  over  Gaussian  estimators  when  faced  with  im¬ 
pulsive  noises.  For  estimating  the  direction  of  arrival  of  a  signal 
to  a  sensor  array  in  [6],  maximum  likelihood  estimators  designed 
assuming  Cauchy  distributed  noises  were  shown  to  exhibit  perfor¬ 
mance  very  close  to  the  Cramer-Rao  Bound  against  SaS  noises 
with  characteristic  exponents  1  <  a  <  2.  Similar  performance  was 
observed  in  various  applications,  including  processing  data  in  a  multi¬ 
user  communication  network  [7]  and  radar  glint  [8];  in  particular,  the 

This  work  was  partially  supported  by  Air  Force  Office  of  Scientific 
Research,  Award  No.  FA9550-09-1-0374,  and  by  the  United  States  -  Israel 
Binational  Science  Foundation,  Grant  2008040. 

J.  H.  Fernandez  is  with  the  department  of  Mechanical  and  Aerospace 
Engineering,  University  of  California,  Los  Angeles. 

Email:  jhf@seas.ucla.edu. 

J.  L.  Speyer  is  with  the  department  of  Mechanical  and  Aerospace  Engi¬ 
neering,  University  of  California,  Los  Angeles. 

Email:  speyer@seas.ucla.edu. 

M.  Idan  is  with  the  Faculty  of  Aerospace  Engineering,  Technion,  Haifa, 
Israel.  Email:  moshe.idan@technion.ac.il. 


a  parameter  for  the  in-phase  component  of  a  time  series  of  sea  clutter 
in  radar  in  [6]  was  calculated  to  he  a  =  1.7.  A  framework  based  on 

these  stable  distribution  models  was  developed  in  [9]  and  shown  to 
have  significant  improvements  in  performance  against  heavy  tailed 
noises. 

The  apparent  robustness  and  adaptability  of  the  Cauchy  probability 
model  motivated  the  derivation  of  a  sequential  estimator  for  linear 
scalar  systems  [10,  11],  and  subsequently  for  general  vector-state 
systems  driven  by  Cauchy  noise  [12, 13].  The  estimator  for  general 
vector-state  systems  suffers  from  severe  growth  in  numerical  com¬ 
plexity,  limiting  its  use  to  a  small  number  of  measurement  updates 
and  states.  The  aim  of  the  current  work  is  to  develop  an  efficient 
two-state  estimator  to  process  measurements  more  quickly,  and  to 
arrest  the  growth  in  complexity  in  order  to  implement  the  Cauchy 
estimator  for  an  arbitrary  number  of  time  steps. 

The  methodology  in  [12, 13]  is  based  on  finding  the  characteristic 
function  of  the  conditional  pdf  of  the  state  given  the  measurement 
history.  This  work  follows  that  same  procedure,  and  by  exploiting 
certain  relationships  for  the  two-state  structure,  we  can  greatly  reduce 
the  complexity  of  the  algorithm.  The  proposed  algorithm  was  derived 
inductively  by  working  out  the  first  three  measurement  updates,  and 
then  deducing  the  general  update  process.  In  this  paper,  we  present 
the  first  two  measurement  updates,  followed  by  the  general  recursion. 
It  has  been  checked  against  the  results  in  [12, 13], 

The  CF  is  expressed  as  a  sum  of  terms,  each  of  which  has  two 
components:  a  coefficient  function  denoted  by  G  and  an  exponential 
function  with  argument  E .  These  functions  are  shown  to  have  known 
structures  that  persist  across  measurement  updates,  and  parameters 
that  are  contained  in  a  set  of  fundamental  arrays.  The  essence  of  this 
paper  is  in  deriving  this  structure  and  populating  the  arrays,  which 
allows  for  a  drastic  reduction  in  the  complexity  of  the  algorithm.  We 

begin  by  presenting  the  problem  in  Section  II,  and  performing  the  first 
measurement  update  for  a  general  vector-state  system  in  Section  III. 
Then,  in  Section  IV,  we  assume  a  two-state  system  and  derive  the  pro¬ 
posed  estimator  structure  for  the  first  measurement  update,  followed 
by  the  first  time  propagation  step  in  Section  V.  Section  VI  derives 
the  proposed  estimator  structure  for  the  second  measurement  update, 
and  Section  VII  presents  the  general  measurement  update  recursion 
algorithm.  Section  VIII  discusses  the  finite  horizon  approximation  of 
the  full  infonnation  estimator  by  using  a  fixed  window  of  the  most 
recent  measurements.  Finally,  numerical  examples  are  presented  and 
discussed  in  Section  IX,  and  concluding  remarks  are  given  in  Section 
X. 

II.  Problem  Formulation 

Consider  a  discrete  time,  linear  system  described  by 

x(k  +  1)  =  <I>x(/c)  +  rw(k),  (la) 

z{k)  ~  Hx(k)  +  v(k),  (lb) 

where  X(k)  €  Rn  is  the  state  vector,  u(k)  is  a  scalar  deterministic 

input,  Z(k)  is  a  scalar  measurement,  and  w(k)  and  V(k)  are  scalar 

independent  Cauchy  distributed  process  and  measurement  noise  in¬ 
puts.  The  noise  inputs  have  medians  at  zero  and  scaling  parameters 
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of  P  and  y ,  respectively,  so  that  their  pdfs  are  given  by 

fw  w(k)  - 

fv  v(k)  = 

The  characteristic  functions  of  th 
(pwip) 

<pv(o ) 

where  a  is  the  scalar  spectral  variable. 

The  initial  condition  is  assumed  to  be  a  product  of  scalar,  inde¬ 
pendent  Cauchy  distributed  random  variables  denoted  by 

ft  a,/TT  ,,  , 

fx,  (x(l»  =  - 1 - -■  (4a) 

fel  (*/0)  -  X'( l))2  +  ct-i 

Its  characteristic  function  is  given  by 

cpXl  (y)=  e 


P/TT 

(2a) 

w\k)  +  pl  ’ 

y/n 

(2b) 

v\k)  +  yz ' 

ie  pdfs  are 

e-p\o\, 

(3a) 

e~vM, 

(3b) 

'=  1  cr,  Vi  +j*i(\')Vi 


<  p  ^ 

=  exp  —  a;  \v,\  +jX(V)'v  ,  (4b) 

/=  1 

where  V,  is  an  element  of  v  e  Rn.  The  algorithm’s  structure  is  greatly 
simplified  by  assuming  that  the  median  in  (4b)  is  X(l)  =  0,  i.e.,  the 
initial  condition  is  centered  at  the  origin.  To  preserve  the  generality 
of  the  initial  condition  (4b),  the  system  in  (1)  can  be  decomposed 
into  two  systems,  a  stochastic  variable  initialized  at  the  origin  and 
driven  by  W(k),  and  a  deterministic  system  initialized  at  X(l),  such 
that 

X(k)  -  X(k)  +  Xk),  (5a) 

z(k)  -  z(k )  +  2(k).  _  (5b) 

The  dynamics  and  measurement  equations  for  Xand  x  are  given  by 

X(k  +  1)  =  <SX(k)  +  W(k),  Zfk)  =  HX(/c)  +  v(k),  (6a) 

x(/c  +  1)  =  <t>x(k),  z(k)  =  Hx{k).  (6b) 

Then,  the  proposed  algorithm  can  be  applied  to  the  system  in  (6a), 
and  the  deterministic  part  in  (6b)  can  be  used  to  recover  the  state 
estimate  for  X(k)  from  the  estimate  of  X(k)  by  using  (5a).  Therefore, 
the  presentation  of  the  algorithm  will  assume  that  X(l)  =  0  in  (4b) 
without  loss  of  generality. 

Finally,  the  measurement  history  used  in  the  estimation  problem 
formulation  is  defined  as 

Z*  :=  {z(  1), . . . ,  z(k)}.  (7) 


systems  using  the  characteristic  function  of  the  conditional  pdf  were 
developed  and  presented  in  [1 1, 14],  An  algorithm  for  a  multivariate 
Cauchy  estimator  is  presented  in  [12, 13]  and  summarized  here. 

The  CF  of  the  initial  state  conditioned  on  the  first  measurement  is 
given  by  the  vector  integral 

poo 

<PU1W  =  fXl  izi  (X(l) \z(\))eiv  '  x^dx(  1).  (9) 


The  conditional  pdf  is  computed  from  the  joint  distribution  of  X(l) 
and  Z(l)  using  Baye’s  Theorem  [1]  as 


f><]  |z,(x(l)  |z(l))  —  fzi(z(1)) 

_  fzx  |X,  (Z(l)  I  x(l))fx!  (X(l)) 


Hx(l)  )fxL(x(l)) 


f  (Z(l)) 


(10) 


Zi 


Then,  (9)  can  be  expressed  as 

fbo 

<Pm(V)=  fZl(l(W  ^(Z®_HX(1)) 

—co  _ 

yv'x(i)  ,  in M 

x  fX]  (x(l))e  dX(1)-  fz,  (z(l))  ’  (11) 

where  cp^  ^  ^  (v)  is  the  characteristic  function  of  the  unnormalized  cpdf 

(ucpdf).  Note  that,  since  Z(l)  is  known,  fz,(Z(l))  is  a  constant;  since 
<Pmk=o  —  1,  then  cp  \  =f,(z(  1)). 

1|1  \/=0  z 

Using  the  dual  convolution  property  [15],  (p]  1 1  (]/)  in  (11)  can  be 

expressed  as  n  convolution  integrals  in  the  v  domain  between  the 
characteristic  functions  of  fv  (Z(l)  —  Hx(l))  and  fx ]  (X(l)).  The 

CF  of  fx,  (X ( 1 ))  is  given  in  (4b).  The  CF  of  f/(Z(  1)  —  Hx(l))  is 
denoted  cpv  (V)  and  is  given  by 


r°°  r°° 

cpviy)^  ■■■  fv(z(i)~  hjXi(\)) 

—oo  — CO  ,=  1 


xd  ?=lViXi(r,dxy\)...dxn(V).  (12) 


The  objective  is  to  derive  the  conditional  mean  estimator  for  this 
system.  To  do  this,  we  find  the  CF  of  the  pdf  of  the  state  at 
time  k  given  the  measurement  history  Zk,  denoted  as  Cpk\k(Y)-  The 
next  section  begins  with  the  CF  of  the  pdf  conditioned  on  a  single 
measurement. 


III.  First  Measurement  Update 

Begin  with  the  first  measurement  update  at  k  =  1  by  taking  a 
noisy  measurement  of  the  Cauchy  distributed  initial  state  as 

z(l)  =  Hx(l)  +  v(l).  (8) 

Here,  X(l)  and  1/(1)  are  the  Cauchy  random  variables.  For  a 
scalar  system,  i.e.  X(l)  e  R  ,  the  conditional  mean  estimator  has 

been  derived  and  is  presented  in  [10],  For  vector-state  systems,  an 
approach  based  on  determining  the  characteristic  function  (CF)  of  the 
conditional  pdf  (cpdf)  is  used.  Initial  results  for  estimation  of  scalar 


In  order  to  proceed,  we  need  some  assumptions  about  the  measure¬ 
ment  vector  H  ❖  [h\ . .  .  hn].  The  first  is  that  at  least  one  element  of 
H  is  nonzero,  i.e.  there  exists  an  /  such  that  hi/—  0.  This  assumption 
is  a  prerequisite  for  observability  of  the  state.  The  second  assumption 
is  that  this  nonzero  element  is  hn ,  which  has  no  effect  on  generality 
[16]. 


To  carry  out  the  integration  in  (12),  perform  the  change  of 
variables:  f  =  Z(l)  —  ’  /?,X;  to  write 


/=! 


*n(l) 
dxn(  1) 


hn 

dt 

\hn\  ' 


£ 


hjXi(  1) 


\ 


/=! 


(13a) 

(13b) 
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This  allows  us  to  manipulate  (12)  as 


Rewrite  (18)  using  (3b)  and  (4b)  as 


|CO  |CO 

<Pr(v)  —  ■  ■  ■  fv(£)X 


—  OO  — CO 


I  n- 1 

n-\  i=i  hix,XV) 

J  /=1  WXrtlHW,  TTn  _d4_dXl(l)  .  .  .  dxn-x{  1) 

□  |/Jn|n 

p° 

— CO 

□  □ 

100 100  J 

x  D  e  /=1  ""  dxi(l) . . .  dxn-ii.(l)n .  (14) 


—  CO  — OO 


Here,  the  left  parenthesis  equals  the  CF  of  fv  if  the  spectral  variable 
is  —  " .  The  right  parenthesis  is  a  product  of  Dirac  delta  functions, 

hn 

5(0,  giving  ^ 

e/*sz(D  t/vAnpi1"  (  h  \\ 

=  ~K  , ,  (”)5  Vl~r„v"  ■ 

(15! 

Using  <pxi  of  (4b)  and  <#/  of  (15),  we  can  express  the  first 
measurement  update  ucpdf  s  CF  using  the  dual  convolution  property 
[15]  as 


_  f°°  poo 

.  J !  J  00  =  ■  ■  ■  fXl  (X(i))fv  (2(1)  -  Hx(l))e/V  T*^dx 


—  OO  —  CO 

poo  poo 


(277)" 


=  (2") 

(277)"  |  hn 
n—l  C 


—  OO 

poo  poo 


cpx 1  (V  -  CT)  <3/  ( o)do 

-OO 

/CT"zcn  (  \ 

<Pxi (t/  -  a) e  **-  (pv  -On. 

hn 


Z—  OO  — OO 


hi 


\ 


7=1 


5  Oi-—On  dO]  .  .  .  dOn-\  dOn.  (16) 

tin 


Integrating  over  G\  . . .  On-\  is  simple  due  to  the  delta  fiinctions  in 
(16),  and  results  in  a  single  integral  over  the  scalar  On: 

1 

I  ,  (V)  - _ 

111  277  |  hn  | 


fbO 


( 


x  cpx  1  I/!  -/7i  t/n-1  -  hn-ihn,Vn- On 


1  17„ 

yCT"z(i)  (  \ 


hn  ’ 


x  e  hn  cpv  don 


\ 


277  [  hn  | 


r°°  ( 

,  ,T  , 

<PXi  H  t-  & hn  z(  i) 

n  n 


( 


<Pv  Th 


dOn. 


(17) 


Finally,  we  have  a  convolution  integral  involving  the  CF  of  the  pdf  of 
the  state,  and  the  CF  of  the  measurement.  This  result  indicates  that 
the  ucpdf  s  CF  for  a  system  of  arbitrary  (finite)  order  conditioned 
on  a  scalar  measurement  can  be  determined  from  this  single,  scalar 
convolution  integral.  A  simple  change  of  scalar  variables  (7  =  Or/hn 
and  don  —  do  |  hn  |  gives  the  final  fonn 

j  I” 

1  =  277  cpx' (,/  _  hTc7)  Vv  (-o)  (Jzm°do.  (18) 


1  titOO 


pco 


I  ( 


/>^.i 


\ 


exp  _  p  \p  -a I  +jz(l)o  do,  (19a) 
ATT  j—\ 


p  ~a  E  H1  for  (  1,  ...,77  p  =  v,  (19b) 

G  {  }  "+i 

u  =;J^forf,e{\,...,n}  pn+i  =  0,  (19c) 


where,  £  is  the  f,  row  of  the  77-dimensional  identity  matrix.  Note 
that  the  p  are  scalars  linear  m  V,  i.e.  inner  products  ot  v  with  given 


vectors.  The  solution  to  (19)  is  given  in  [12, 13]  as 


2  TT  /=1 
n+1 


□ 


n  ^  n 

-X  'V  x  p  |p  —  p,|  +yz( i)py 

=*  277 


□ 


/=  1 


=  1 
/=/ 


°  L .  (\'J  □ 

X  JZ(l)  +  pi  +  p  sgn(p  -  V)  D 

n  =1 

/=/ 


B-i 


□ 


□  -i 


t^?rl-l 


_  l  /z(  1)  _  Pi  +  P  sgn  (p  -  p,) 
=1 
/=/ 


(20) 


Thus,  the  measurement  update  process  produces  a  sum  of  exponential 
tenns  with  coefficients  in  the  brackets.  For  the  two  state  system,  as 
will  be  shown,  the  bracket  tenn  can  be  reduced  to  a  simple  form 
that  is  a  polynomial  of  sign  fiinctions.  Next,  we  present  a  structure 
for  expressing  (20)  for  the  two-state  system  that  can  be  extended  to 
subsequent  measurement  updates. 


IV.  Two-State  Estimator  Structure  for  the  First 
Measurement  Update 

In  the  previous  section  we  considered  a  system  with  a  state  vector 
of  general  order  77,  and  an  algorithm  for  the  general  vector-state 
system  is  presented  in  [12, 13].  However,  that  algorithm  suffers  from 
a  very  aggressive  growth  in  computational  complexity  with  each  new 
measurement  update.  For  a  second  order  system  there  are  certain  pat¬ 
terns  and  algebraic  relationships  that  allow  for  significant  reductions 
in  numerical  complexity  and  allow  the  estimator  to  run  effectively 
over  a  large  number  of  measurement  updates.  There  are  two  main 
aspects  to  this  simplification:  a  way  of  expressing  and  indexing 
vectors  that  multiply  (as  inner  products)  the  spectral  vector  v  in  the 
exponential  argument  in  (20);  and  a  set  of  algebraic  relations  that 
can  be  used  to  simplify  the  coefficients  of  the  exponential  fiinctions. 
Both  of  these  aspects  are  addressed  here  for  the  first  measurement 
update  in  (20)  presented  above.  In  Section  VI  the  second  update  is 
presented,  indicating  by  induction  the  general  measurement  update 
and  time  propagation  recursions  given  in  Section  VII. 


A.  Exponential  Argument 

Consider  the  arguments  of  the  absolute  value  terms  in  (20).  The  p 
scalars  are  defined  in  (19c)  as  scaled  inner  products  of  v  with  vectors 
we  call  fundamental  directions.  For  the  first  measurement  update, 
these  fundamental  directions  are  the  rows  of  the  77X77  identity  matrix. 
For  the  two-state  system,  the  set  of  fundamental  directions  from  the 
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r  [  r  I 

initial  condition  is  Buo  =  p.  10 

tj  =  o  i  .  In  the  subscript  of 

Bi|o,  the  first  element  denotes  the  time  step,  and  the  second  element 
denotes  the  number  of  measurements  that  have  been  processed. 

Inner  products  are  linear  operations,  and  hence  a  difference  of  inner 

products  with  a  given  vector  is  also  an  inner  product.  This  new  inner 
product  introduces  a  new  fundamental  direction.  This  new  direction 


is  the  sam 
denote  its 


cfor  any  real,  2  x  2  Bgo.  Using  a  superscript  .on  B,  in 
/,  row,  apply  this  notation  to  me  definitions  in  (19c) 


to 

to 


Using  (25)  and  the  definitions  for  p  from  (19b)  in  (20)  yields 

1  lll0/)  ( 

GinOOexp  -<*2  E2HT  “  HAv 

(fiHT)(E2HT)  \ 

E\V  E\V 

(  +VZ(1)EilTT 

2  T _ HAv _ 
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'  fJm 


B 

110^ 


BT\ov 

m  i 


mT  T  m 

_  HBj|0  Bwv-  HBlipBuov 


Hfil|0Sl|0H 


_  H  v 


(Bm  HT )(B  HU 
no  no 


(21) 


The  key  here  is  to  recognize  that  the  term  in  parenthesis  in  the 

numerator  of  (21)  is  a  matrix  minus  its  own  transpose,  which  implies 

r  I 

that  it  is  antisymmetric,  i.e.,  that  BmlB  —  B  1  Bm  =  0  C 

no  o  no  no  -c  o 

for  some  C  E  R.  This  constant  C  can  be  computed  and  pulled 
out  of  the  matrix,  which  allows  us  to  express  any  two-dimensional 

r  I 

antisymmetric  matrix  as  CA  where  A  =  0  1  and  C  can  be 

J  -10 
verified  to  be 


mT 


c  =  -B  A B 

no  no 


(22) 


Hence,  we  can  write  (21)  as 


,  mT 

-B  n°ABHo 
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(e;'|0H^B 


tXr  ti1) 


no 


■  HAi/. 


(23) 


This  produces  the  new  fundamental  direction  is  HA,  scaled  by  the 
tenn  in  parenthesis.  For  the  relationship  in  (23)  to  hold,  //  /=  yUm 
and 

neither  //  nor  /Jm  can  equal  zero.  In  (19c)  we  also  defined  an  extra 
constant  yL/3  =  0.  This  implies  that  the  old  fundamental  directions 
are  retained  in  the  measurement  updated  cpdf  s  CF  for  terms  in  the 
exponent  of  (20)  that  involve  yL/3.  Therefore,  the  set  of  fundamental 
directions  for  the  first  measurement  update,  denoted  by  B|  1 1,  is  given 

by 


Bm 


£1  ‘ 

=  □  B  □. 

HA 


(24) 


In  (20),  we  use  (23)  to  express  // 2  —  // 1  as 


/I2-1U1 
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e-tf-  e- hl 

2  1 
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HAi/ 
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\ 


2  £  HT 
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_ Y. _  IT  v  - -  IT  Aw  1  j  Z(l)  C 

x  exp  -  |£  Ht|  I  !  I  -  I E  HT|  \HAV\  J  E  HT11! 
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111(14)  exp  (— aj  |Eii/|  -  cr2  I E2t/| ) .  (26) 


Notice  that  each  of  the  three  tenns  involves  only  two  of  the  three 
fundamental  directions. 

The  efficiency  of  the  proposed  two-state  estimator  is  achieved  by 
both  keeping  track  of  which  directions  are  used  in  each  term,  as  well 
as  the  scalar  coefficients  that  multiply  the  absolute  value  functions 
and  the  scalar  coefficients  in  the  imaginary  part  of  the  argument  of 
the  exponential.  The  most  important  of  these  is  an  array  of  integers 
where  the  elements  of  each  row  correspond  to  the  rows  of  Bm  that 
appear  in  the  exponential  argument.  This  array  is  denoted  by  M(  1 1 1). 

The  exponential  argument  of  the  term  corresponding  to  /  =  3 
in  (26)  is  exactly  the  same  as  the  initial  condition.  In  fact,  the  only 

difference  between  the  initial  condition  and  this  term  is  the  coefficient 

Gm-  This  is  due  to  the  cancellations  that  occur  in  general  in  (26), 
so  that  one  term  produced  from  the  convolution  will  always  have  the 
same  exponential  argument  with  a  different  coefficient  function.  This 
tenn  is  referred  to  as  the  old  term.  It  will  be  shown  later  that  the 
old  exponential  arguments  persist  across  measurement  updates,  and 
therefore  it  is  useful  to  order  the  tenns  with  the  old  fundamental 
directions  first,  so  that  the  last  tenn  in  (26)  moves  to  the  top. 
Consequently,  M (1 1 1)  is  constructed  as 

0.  3 

1  2 

M(l\l)  =  0\  3D.  (27) 

2  3 


The  other  two  terms,  conesponding  to  /  —  1  and  /  =  2  in  (26), 
are  called  the  intermediate  new  terms',  we  say  intermediate  because 
in  all  subsequent  measurement  updates,  as  will  be  shown,  they 
will  combine  with  other  tenns  with  the  same  exponential  argument, 
and  new  because  they  involve  the  new  fundamental  directions  just 
generated  during  the  measurement  update. 

Following  this  new  ordering,  we  define  two  additional  arrays 
P  (1 1 1)  and  Z(1 1 1)  whose  elements  conespond  to  the  coefficients 
of  the  absolute  value  functions  in  the  exponents  and  the  coefficients 

th 

in  the  imaginaiy  part,  respectively;  hence,  the  /  rows  of  P(1  |l) 
and  Z(1 1 1)  are  related  to  the  Ith  term  in  For  the  measurement 


(£!HT)(£2HT) 


updated  ordering  of  terms  used  to  define  M(\  |  1)  in  (27),  these  arrays 


5 


are  given  by 

,S!  a, 

□  v 

P(i|i)=  nl  PT 

-  Y 

IW| 


□ 

a2 

a2  U 

|EiHT|D  , 

Ofl  U 

Wp\ 


□ 

□  I, 

Z(l|l)=  e4&- 

□  b. 


E2HT 


□ 

°0D 

:  ■ 

o" 


(28) 


Tliey  have  the  same  dimension  as  1W(1|1),  so  that  an  element  of 
P  (1 1 1)  or  Z(  1 1 1)  goes  with  the  fundamental  direction  indexed  by 
the  corresponding  element  of  !W(l|l). 

Finally,  define  a  vector  array  of  integers  L  m  with  as  many 

elements  as  rows  of  /W(l|l).  Each  element  of  Lm  indicates  the 
number  of  fundamental  directions  in  that  tern,  i.e.,  the  width  of  the 
corresponding  row  of  M(\  |  1).  For  the  first  measurement  update,  Lm 
is  given  by 

f  1T 

Lm  —  2  2  2  ■  (29) 


This  final  array  is  unnecessary  in  the  first  measurement  update,  but 
will  become  essential  later  when  different  terms  involve  different 
numbers  of  fundamental  directions.  Finally,  define  the  number  of 

terms  as  N m.  Clearly,  N m  =  3,  the  same  as  the  number  of  rows 
of  1W(1|1),  P(l|l),  Z(l|l),  and  Lm. 

Using  (24)  (27)  (28)  (29)  and  the  definition  for  N\\\  -  3,  the 
exponential  argument  for  the  first  measurement  update  ucpdf  s  CF 
given  in  (20)  can  be  expressed  as 

U  <  2  X 


filiOO--  Pi  BU1V  +j  Z/B1U  v, 

=  1  =1 

/  e  (30) 


that  is  indexed  by  the  integer  M It  is  assumed  that  P  ,  Z,  and  M 
inherit  their  time  index  from  the  associated  fundamental  directions 
from  B. 


V.  First  Time  Propagation 


In  this  section  the  characteristic  function  of  the  cpdf  is  propagated 

that  at  an  arbitrary  time 

k(X(k)),  its  CF  is  given  by 
Vkikffi),  and  that  the  process  noise  CF  is  given  as  in  (3a).  The  pdf 

of  X(k  + 1)  is  determined  using  the  linear  transformation  of  the  joint 
distribution  fxk,w(Xk,  Wk).  The  details  of  this  derivation  can  be 
found  in  [12, 13],  and  the  formula  for  the  characteristic  function  of 
the  propagated  cpdf  is  given  by 

T  T 

(pk+\\k{V)  =  <Pk\k(<£>  WVwtF  V).  (33) 


Note  that  the  derivation  of  (33)  in  [12,13]  requires  that  <t>  be 
invertible. 

Applying  the  time  propagation  equation  (33)  to  the  characteristic 
function  of  the  first  measurement  update  given  in  (20)  yields 

(p2 |iM  =  <P_,  M  (oTi/)<pw(rTi/) 

c p  (O’t /)(p  (Ttv)  cp  ( V ) 


z1 

l(z(l))  f  (*(  1)) 

Then,  the  ucpdf  s  characteristic  function  for  the  propagated  state  is 
given  by 


B.  Polynomial  Coefficient 

Using  (25),  (19b),  and  (19c),  the  bracketed  terms 


G' 

Hi 


in  (20)  are 


1  211  O') 


-  <P 

=  "&  T  tf|l(® 


1((t'1V)  ■  e  ^1  r 

T  T 

V)  _  /3  r  v 


of  the  same  form,  and  each  involves  two  sign  functions,  denoted 
Si  and  S2,  with  the  same  arguments  as  the  absolute  values  in 
the  corresponding  exponential  parts.  Functions  of  this  form  can  be 
reduced  to  a  four  parameter  polynomial  of  these  sign  functions  as 


ai+biSiS2+JCjS\  +jdiS 2.  These  relationships  are  given  in  Appendix 
A;  for  k  —  1  in  particular,  Result  3  in  Appendix  A  can  be  used  to 


obtain  the  four  parameter  polynomial  by  assuming  that  am  =  1 

/ 

and  b,m  =  c™  —  d™  ~  0.  The  sign  functions  involve  the  same 
fundamental  directions  as  the  exponential  argument,  so  denote  a  final 
array  G(l|l)  with  the  same  number  of  rows  as  /W(l|l)  and  width 

four,  and  let  each  row  contain  the  parameters  fog  the  polynomial,  as 

3\  h  c,  di 

G(l|l)  =  ca2  b2  Co  d2°.  (31) 

a>  b2  Co  d2 


Note  that  the  same  reordering  used  to  fonn  A//  (1 1 1)  must  be  used 
here,  so  that  the  top  row  contains  the  parameters  for  the  old  tern’s 
coefficient. 

Using  the  parameters  in  (31),  along  with  (24)  and  (27),  the 
coefficient  function  G|n  for  the  first  update  ucpdf  s  CF  given  in 
(20)  can  be  expressed  as  a  polynomial 

Gill  O')  ~  a'  +  L>/Sgn  B^'(V  sgn  q  v  + 


jCi  Sgn  B 


Ull 


111 


jdi  sgn 


"111 


(32) 


Remark  1:  In  the  notation,  for  P ,  Z,  and  M  the  subscripts  denote 
the  row  (i.e.  which  term  in  the  sum)  and  the  superscripts  denote  the 


i=l 


m(® 

□ 

□ 


Mj 


T 


V)  ■  exp  □  —  Pi  6,1)0  V  -|8  r  V 

/=! 


=1 

C\"  \  \ 

+JA-  2  z  BM‘  .  (35) 

7  Hi 
=i 

The  changes  in  <p2  \  \  (V)  from  q>{  1 1  (v)  are  a  new  element  in  the  sum 
of  the  absolute  value  terns,  and  a  linear  transformation  on  V.  The 
time  propagation  step  adds  no  new  terms  to  the  sum,  hence  No\\  - 
N\\\.  Moreover,  since  the  process  noise  has  a  zero  median,  the  time 
propagation  has  no  effect  on  the  complex  part  of  the  exponential 
argument,  so  that  Z(2|  1)  =  Z(1 1 1).  Finally,  there  is  no  effect  on  the 
parameters  of  the  coefficient  functions  G,  so  that  G(2 1 1)  =  G(l|l). 
The  time  propagated  _ucpdfs  characteristic  function  can  be  restated 
in  the  same  form  as  cp i.e., 

tp  (V)  —  ^  G'  (Y)  ■  e^7b(y\  (36a) 

2|1  2|1 
/=! 

where  , 

/  ^  M'  (  ? 

G2|i (V-)  =  a,-  +  L,  sgn  B1}(V  Sgn  E^'V 

+JCi  sgn  BfM'V  +  jdi  sgn  B^{  V  ,  (36b) 


column  (i.e.,  where  in  the  exponential  argument  it  appears).  Only 
B,  and  N  retain  the  time  dependence  in  their  subscripts.  The 
superscript  for  B  denotes  the  row,  i.e.,  the  fundamental  direction, 


fou  (  iV»  ^ 

i  M.  *  D  M. 

£2|l(V)  =  “  Pi  B2\lV  +J  Z'B2\'  V>  (36c) 
=1  =1 
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Z(2 1 1)  =  Z(1 1 1), 


Using  (36c)  we  define 


p3 


P (2 1 1)  =  □P(l|l) 


a\ 

„  — 


□  / 

/3D  =  MHiHT| 

ft  ~  ■*- 


a2 


I 


S2|i=  SW® 


E#T 
□  e2o 


Cf2 

\E0\ 

Qfi 

l#2HT| 


M(  2 1 1)  =  d/W(l|l) 

Z_2 1 1  —  l-l\l  +  ^1^  =  L3L  , 


irT  □ 

1  2  4 

4d  =  D1  3  4°, 
4  2  3  4 

□  □  □  □ 


|8i 

□  , 

P 


The  process  noise  introduces  a  third  absolute  value  term,  with  a  new 
fundamental  direction,  into  all  of  the  CF’s  exponential  arguments. 
The  next  section  deals  with  performing  the  next  measurement  update 
to  this  propagated  structure. 


VI.  Second  Measurement  Update 


/#■=  G2|l('/-H1CT) 


I 


(36d) 


X  exp  £2h(v  -  h  a)  -  y\o\  +  jaz( 2) 
211 

—  00 

□ 

n  +«u  m  t 

X  exp  -  Pi  B  '(I/-H Tcr)  —  /  |cr| 


□ 


=  1 


211 

\ 


(  2  .y. 

+j  Z,-B2\i  (y  -  HTa)  +joz(2)  da.  (39) 
=1 

In  order  to  solve  (39),  we  need  to  rewrite  it  in  a  manner  similar  to 
(19)  using  p  and  p  substitutions.  Begin  as  in  the  first  measurement 
update  by  defining  constants  p  obtained  by  expressing 
M  t  /  T 

B2|1V-H  iP~0),  (40a) 

and  thus 

Bm, 


P  -  2|l 


fv 


~U! - 


(40b) 


Next,  we  rewrite  the  argument  of  the  exponential  of  (39)  in  terms  of 
these  p  s  and  a  as 


The  second  measurement  update  process  involves  finding-the 
characteristic  function  of  the  unnonnalized  conditional  pdf,  i.e.,  <j 0  . 


The  formula  used  for  this  second  measurement  update  is  the  same 
as  the  first  update,  given  in  (18),  and  is  applied  to  (36): 


1  2 1 2 


00 


1  °° 

2 -r  r2|1(v  —  HTo)cpv  {—o)daz(2,da 

—  00 

-1  00  _ 

cp  (v  —  HTcr)  e  _  K 1 CT 1  e/CTZ(2)  do. 


— 00 


(37) 


Since  (p2^  is  a  sum  of  A/2 1 1  =3  terms,  the  measurement  update 
process  is  to  solve  the  convolution  integral  Ni\i  times,  once  for  each 
tenn.  Substituting  (36a)  into  (37),  we  get 


1  t°°  C^» 

277  ^2\\(y  -  HTCT) 

-CO  '=!  | 

x  exp  E2il(v  -  h'o)  -  y  I  — cr|  +joz(2)  da.  (38) 


Interchanging  the  integration  and  summation  operations  produces 
A/2|i  convolution  integrals.  As  with  the  first  measurement  update, 
each  convolution  will  produce  an  old  tenn  as  well  as  new  terms, 
called  intermediate  terms.  Since  many  of  these  tenns  have  the  same 
exponential  arguments,  they  can  be  combined  to  reduce  the  total 
number  of  terms  in  the  CF’s  sum. 

Therefore,  we  begin  with  an  arbitrary  Ith  convolution  in  (38). 


P' 


=1 


B2]l  ^ 


I P  -o\-y\-o\ 


j(z]BMil 


ZB 


( 


21 1 
f  1  T 


2|1 


V 

+ j  z(2)  -  z'bm '  HT  -  Z2B^  ht 
,  '  211  '  2|1 
,(  ,  Ml  2  M- 

—  j  ~Z-  i  1 1  ^9  1 1  ^ 


z|  1  z"/  lj2  1 1 

kn\h-1^ 


p  Ip  -  o\  +j02o, 


=1 


(41a) 


where  the  complex  part  multiplying  v  does  not  depend  on  a 
and  comes  out  of  the  convolution.  The  parameter  definitions  are 


t-i  11  " 


=  P4  =  0, 


1  M 1 


2  M2-  t 


02  =  z(2)  -  Z, fi.2|IH  —  Z,B2!i  H 


-’211  ^'I-’2|l 

and  the  ^^  +  1  =  3  +  1  fa  4  constants  called  p  are  given  by 


(41b) 


B 


efl,..., l-2|i  =3-7. 

211  ie{l,...,N2\i  =3} 

Y  f,  -  L‘  +1=4. 

2 1 1 


(41c) 


This  is  the  same  procedure  used  in  the  first  measurement  update 
except  that,  due  to  the  new  absolute  value  term  in  the  exponential 
argument  introduced  in  the  time  propagation  step,  there  are  now  four 
p  constants  instead  of  three.  Moreover,  there  are  three  fundamental 
directions  in  the  exponential  argument,  instead  of  just  two. 

Let’s  turn  our  attention  to  the  coefficients  of  the  exponents.  The 
coefficient  functions  for  the  three  terms  in  (p2^  are  all  of  the  form 


jCi  sgn  B^'  V 


h  jdi  sgn  B^'V  .  (42) 
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Hence  the  same  manipulation  used  in  (40b)  to  fonn  the  /JS  can  be 
used  here  to  get 

^  m'  x  m2  x 

G(|i(V  -  H  T(jj  =  a,  + A  B2|fH  .  B2|'  h 

sgn 

x  ^  1  (  sgn  (JJi  -  O)  +  Sgn  (JJ2  ~  O)  —  1 
2 

( 

+jCi  sgn  sgn  (P\  ~  O) 

+ jdi  sgn  sgn  (^2  -O),  (43) 


where  the  identity  sgn^i  —  O)  sgn  (p2  —  O)  — 

2  ( sgn  (p  i  —  cr)  +  sgn  <JJ2  —  a))  —  1  is  used. 

In  order  to  use  the  same  integration  method  as  in  the  first  mea¬ 
surement  update,  it  is  necessary  for  the  coefficient  of  the  exponential 

tenn  to  be  constant  within  each  subdomain  of  integration  (u  ,  p  +i). 
Since  these  coefficients  are  polynomials  of  sums  of  sign  functions. 


they  are  clearly  constant  in  these  regions.  Now  define  the  following 
constants, 


3i  —  a,, 

—  _  ^  M 1  x 

Pi  -C/ sgn  B2|1h 

A  =  l. 


bi  =  bi  sgn  B™\ HT  ■  BMa[  Ht 
A?  a  di  sgn  bM^HT  , 

A  =  1. 


(44) 

Our  integration  method  also  requires  that  the  coefficient  involve 
sums  over  the  same  sign  functions  as  the  exponential  argument,  which 


has  two  more  absolute  value  functions  than  the  coefficient  has  sign 
functions.  To  do  this,  introduce  two  more  sign  functions  into  the 
sum  in  (43)  by  multiplying  them  by  constants  A  =  A  =  0  and 
A  =  At  =  0.  Using  these,  the  coefficient  in  the  convolution  can  be 
written  with  sums  over  the  same  jJ  as  in  (41a): 


□  (  V,  □ 

■i  V  •Yl”  V 

/  T  1.  V 

G2|i(v-H  cr)  =  a,  +  A/n2  P  sgn (p  -  o)  -  1D 


(■V 


+j  A  sgn  (A  -  a) 
=i 


\ 


\ 

\ 


G2|1  a  sgn  (a  -  cr),  p  Sgn  (A  -  cr) 

=1  (  =* 

G2|l  A  sgn  (A  -cr)  .  (45) 

=l 


Since  the  only  difference  between  the  summations  in  (45)  are  the 
scalar  p  and  p  constants,  we  can  simplify  the  notation  and  use  a 
shorthand  p  in  (45)  to  represent  all  the  summations.  This  notation 
will  become  more  useful  in  subsequent  measurement  updates  as  the 
coefficient  polynomials  become  more  complex. 

Using  these  substitutions,  each  of  the  /  €  {1, . . . ,  N2\\}  convolu¬ 
tion  integrals  in  (38)  can  be  written  as 

^  iy(M 

G,|,  A  sgn  (A  -  cr) 

—  CO  =1 

(  \ 

Xexp  -  #  p  |p  -cr|+ye'cr  ^ 


ment  update  from  [12, 13]  is 


,  _  _ □  1 


□ 


exp  '  /  □ 

□  “  p  |p  -  Pm  I  +j62IJmn 

m=  1  =1 

□  □  ^  m  .  □ 

L'  +1 

n  .  21 1 

Ua^2|lD+An+  p  Sgn  (A  —  Am)  ^ 


□ 


=  1 
/=m 


jO'  +  pn 

2 

□ 


'211 +  1 

/=m 

sgn  (A 

Pm) 

□  □ 

?|1+1 

□ 

),  P 

sgn  (p 

-  A m)0  -g 

=  1 

_l=m _ 

y 

■  (47) 


jei2-pm+  Sgn(/J  ~Pm)  ^ 


=i 

/=m 


The  convolution  produces  Z_'  +  1  =  4  terms,  indexed  here  by  777, 

for  each  of  the  /  terms  in  the  sum  in  The  first  three  terns 
are  called  the  intermediate  new  terms  because  some  of  the  terms 
from  different  convolutions  have  the  same  exponential  arguments  and 
can  be  combined  by  summing  their  coefficient  functions  together, 
as  will  be  shown.  The  last  tenn  produced  by  the  convolution  that 
corresponds  to  777  =  4  is  the  old  term  because,  since  =  0, 

it  has  the  same  exponential  argument  as  the  Ith  term  in  (p  ,  1 1  that 
was  convolved.  Simplifications  of  these  terms  and  their  coefficient 
functions  are  addressed  next. 


A.  Second  Measurement  Update  -  New  Intermediate  Terns 


The  terms  corresponding  to  777  e  {1, 2,  3}  are  called  the  mtermedi- 
ate  new  terms.  These  always  involve  only  two  fundamental  directions: 
one  is  from  62|l  and  the  other  is  the  new  direction  HA  as  in  the  first 
measurement  update.  The  formation  of  the  new  fundamental  direction 
is  what  recovers  the  structure  involving  two  fundamental  directions, 
as  will  be  shown  here. 

Until  the  intermediate  new  terms  are  combined  into  the  final 
structure  for  the  second  measurement  update,  it  is  useful  to  denote  the 
exponential  argument  in  terms  of  both  indices  /  and  777.  Hence,  for  the 
intermediate  terms,  denote  the  coefficient  and  exponential  arguments 
as  Gl  m  and  E'  m,  respectively. 


=l 


2 1 2 


2 1 2 


For  the  Ith  term,  use  the  definitions  in  (40b),  (41b),  and  (41c)  to 
The  solution  to  the  Ith  convolution  integral  for  the  second  measure-  rewrite  the  exponential  argument  of  (47)  as 
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°2|1  H 


It  II  + 


=  1 
/=m 
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2\  1  H 
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HT^2|1 
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_  dM  i  \  D  M  m 
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B2|l  H 

^’p  BM‘  ABMi  ^  \HAv\ 


=i 


211 
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B2|f  H7 

z(2)BMfnv-}i  Z  KBm,  B^Tt 

211  >  211  A  211  HA|/ 


+J 


C  C 

2  5 


1  5 


M(2\2)=  n  3 


□D  □' 

5  n=  n  2  5 


(50) 


same  fundamental  directions,  the  coefficients  for  the  absolute  value 
functions  are  different,  as  will  be  shown  next. 

_  Now  we  construct  an  array  of  coefficients  for  the  absolute  value 
functions  m  the  exponents.  From  (48),  a  pattern  emerges  tor  con¬ 
structing  these  coefficients.  They  are  stored  in  the  array  P(2|2)  that 
is  given  by 

P(2|2)  = 


3 — 1 — 

a2  | det O  |  +13  ElO  1  Ar 

u 

D  |E'°THT| 

|E,otht| 

J 

Y 

0T|  1  det  O  1  +  [3  E2®TAT 

0 

n 

I  |  £  <!>:I|T| 

J- 

a  k 

_  |HAd>THT| 
r  irirl 


|  E  OtHt| 
Y  det  ®  +  (3 

I  I  HA<D 


Ar 


|  HA<J>THT  | 

a,  Pi®T  Ar  +  02  E2<t>TAr 

— - W - 


(5D 


BMm  H1' 

This  final  form  for  the  new  intermedfile  tenn’s  exponential  argument 

involves  only  two  fundamental  directions:  one  that  is  a  row  of  6211 

that  corresponds  to  the  Mm  integers  for  m  €  { 1, . . . ,  L1  =  3},  and 
i  2|  1 

the  new  vector  HA.  Moreover,  the  denominators  of  both  terms  are 
equal.  Therefore,  we  define  the  set  of  updated  fundamental  directions 
as  6212  and  construct  it  by  appending  HA  to  the  bottom  of  6211  as 

T  d 
Ei<Dt 

r  1  cd  Ez<1>  m 

S2|2=  =  0HA©ts-  (49) 

□  rT  □ 

HA 


B.  Second  Measurement  Update  -  Recursive  Structure  of  the  Arrays 
Denote  an  array  of  integers  M(2|  2)  where  the  elements  of  a  given 

row  index  the  rows  of  6)12  that  appear  in  the  corresponding  tenn.  The 
rows  of  the  old  terms  will  be  unchanged,  and  a  set  of  new  rows  will 

be  appended  to  the  bottom,  each  of  width  two.  Since  each  convolution 

produces  terns  involving  one  of  the  old  tenn’s  directions,  the  new 
rows  of  1W(2|2)  will  contain  all  possible  combinations  of  rows  of 

St  II  with  the  new  row  in  6212  as 

C  C 

1  2  4 
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Next,  denote  the  new  array  of  coefficients  for  the  imaginary  part  of 
the  exponential  argument  as  Z(2|2).  Since  the  time  propagation  step 

has  no  effect  on  the  imaginary  part  of  the  exponential  argument, 

this  array  always  has  width  two,  involving  only  the  original  two 
directions,  and  it  has  the  same  number  oil rows  as  M(2|2).  Based 

on  the  same  manipulations  used  to  obtain  the  elements  of  P(2|2), 
Z(2|2)  is  given  by 


Z(2 1 2)  = 


I 

°E^ 

□  "  Z(2) 

g  E2OTHT 

□  Z(2) 

□  HAO1  H r 


Z(2|  1) 


0 

0 

_  Z(  1)  det  O 
HAOTHT 


(52) 
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D[  4  5  5  @  3  5 

C  4  5  T3;'  R  4  5  C 

4  5  □  4  5  □ 

4  5 

Each  of  the  three  convolutions  in  the  second  measurement  update 
produced  four  terms,  the  old  tenn  and  three  new  intennediate  ones, 
for  a  total  of  nine  new  terms.  However,  from  (50)  it  is  clear 
that  we  only  have  six  distinct  new  terms  with  different  pairs  of 
fundamental  directions.  This  is  because  three  pairs  of  terms,  from 
different  convolutions,  have  the  same  exponential  arguments  and  can 
be  combined  into  one  tenn  by  summing  their  polynomial  coefficients. 


The  usefulness  of  Z_2|2  is  more  apparent  in  this  measurement 
update,  since  older  terns  involve  more  fundamental  directions.  It  is 
formed  by  simply  appending  an  array  of  2s  of  length  six  to  Z_2|i  as 


— 2 12 


f .  rr  lx 

2  2  2  2  ^2 
f  211  1 

:333222222T. 


(53) 


C.  Second  Measurement  Update  -  New  Coefficients  G 
Consider  now  the  new  coefficient  functions  for  the  new  intermedi- 
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These  three  combined  pairs  produce  the  first  three  new  terms,  and 
thus  they  introduce  the  first  three  new  rows  of  M( 2|2).  The  last 
three  rows  of  M( 2 1 2)  are  due  to  the  time  propagation  step  and 
involves  the  r*  direction.  Although  these  last  three  tenns  have  the 


ate  terms  produced  by  the  Ith  term  of  (p  .  For  m  =  1  and  m  =  2, 
the  numerators  of  (47)  are  not  equal  and  hence  cannot  come  out  of 
the  bracket  tenn.  They  are  of  a  form  compatible  with  Result  3  in 
Appendix  A.  Denote  the  coefficient  of  the  mth  intermediate  term  as 


9 


Gi™.  Then,  the  numerators  for  m  =  1  are  given  by 


GSli  ±A 


=  a  +  b 


+  p  sgn(/V  —  /Vi) 
=2 

-  Di( 


'•J2-  ±A  + 


P  sgn  (p  -  fJi)  -  1D 


( 

+j  ±pi  +  P  sgn  QJ  -pi) 


1^.. 
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\ 


2-  1 


=  a,  +  6,(  ,  ±  A  +  Asgn(/v2  -Pi) 

( 

+7  ±Pi  +Pisgn(jj2  ~Pi) 

(  |  /V/-  -p 

“  a,'±fa; sgn  B"/H t  .  6,,;  H  sgn  (HAV) 

±y'C/sgn  B“'hT  +  jdi  sgn  6^-HT  sgn  (H  A  V)  . 

(54) 

^pse^.  taifni 

that  p  =  0,  the  new  term  obtained  for  m  =  3  corresponds  to  the 
coefficient 

1  1 


/,  3  □  \ 

G2|l  -0  +  ,  P  sg n(P  -P3)D 


□ 


=  1 

(  *«? 


i+bi  sgn  B2| i  HT  .  H  1  sgn  (p;  -  /V3)  sgn  (JJ2  -  /V3) 
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■^2 1 1 


+yc,sgn  B™l  Ht  Sgn  (Pi  -p3)  + 


( 


( 

yd/ sgn  B^HT  sgn  (p2-p3) 


-  a  i+bi  sgn  -Bm]aB2|i 
( 


sgn  (  ,  4  dmi  '1 


M'  M3.  T 

+yc,sgn  -B2|'AB2|1' 


_RA/fl  AB2(i 

D2|l 


+  jdi  sgn  — B^'AB2|i  sgn  (HAt/) .  (55) 

Substituting  these  into  the  bracket  tenn  in  (47),  and  noting  that 
the  real  parts  of  the  denominators  are  similar  to  the  real  part  of 
the  exponential  argument,  yields  a  form  that  is  compatible  with  the 

numerator  forms  given  in  Result  3  to. produce  coefficient  functions 
for  the  intermediate  terms  denote^  ’’  (V)  and  given  by 

(  .  \  (  _  \ 

(V)  =  ai.m  +  bi.m  sgn  V  sgn  V  + 

jCi,m  sgn  B^mV  +jdi  m  sgn  .  (56) 


It  is  necessary  to  use  Result  3  because  the  denominators  in  the 
bracket  term  in  (47)  form  two  sign  functions,  in  the  same  way 

that  the  new  exponential  arguments  (48)  involve  only  two  absolute 
value  terms.  Therefore,  polynomial  coefficients  for  all  of  the  new 


intermediate  terms  can  be  expressed  in  the  simple  four-parameter 
fonn  of  (56).  Note  that  the  two  fundamental  directions  involved  here 
are  the  same  as  in  the  argument  of  the  exponential.  Hence,  combining 

two  terms  with  the  same  exponential  arguments  involves  summing 
their  polynomial  coefficients  ’  ,  which  is  simply  summing  the 

U2|2 


corresponding  parameters. 

For  the  second  update,  three  pairs  of  the  nine  new  intermediate 
tenns  can  be  combined,  leaving  a  total  of  six  new  terms  in  addition 
to  the  three  old  tenns.  The  set  of  combined  parameters  for  the 


G(1 1 1),  producing  the  new  array  G(2 1 2),  which  has  four  columns  and 
the  same  number  of  rows  as  M(2|2).  Then,  the  coefficients  for  the 
new  terms  (i.e., those  corresponding  to  /  €  {4, ...  ,9})  are  given  by 

G212OO  =  a'  +  A;  sgn  B  v  sgn  ( ^  v 

+  jCi  sgn  v  B“y  v  +  jdi  sgn  v  B^y  v  ,  (57) 

where  a#,  bi,  Ci,  and  d,  are  the  elements  of  the  Ith  row  of  G(2 12). 


D.  Second  Measurement  Update  -  Old  Terms 

f  —  i  i  _ 

’  k\k-i  +  1  the  constants  {p,p  }  :=  p  —  0,  which 
implies  that  both  numerators  in  the  bracket  term  in  (47)  are  equal 
and  hence  can  be  pulled  out  of  the  brackets  as  a  common  factor. 
Therefore,  the  measurement  updated  old  tenn  is  the  same  as  the 
previous  old  term,  except  its  coefficient  is  multiplied  by  the  new 
bra^et  ^m,  g  +b  sgn  v  sgn  “ 
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(58) 


Since  the  new  bracket  term  in  (58)  has  three  sign  functions,  it  cannot 
be  manipulated  using  the  results  in  Appendix  A. 

This  implies  that  terms  that  have  been  created  in  previous  measure¬ 
ment  updates  retain  their  exponential  arguments  during  the  current 
measurement  update,  and  their  coefficients  are  multiplied  by  new 
bracket  tenns  as  in  (58),  with  one  additional  sign  function  than  the 

coefficient  had  in  the  previous  update.  The  difference  between  the 
current  time  step  and  the  time  step  when  the  term  was  originally 
created  is  called  the  age  of  the  term,  and  is  given  by  Lk\k  —  2. 
Although  this  complicated  structure  grows,  acquiring  a  new  bracket 
tenn  every  measurement  update,  the  stmeture  of  each  bracket  tenn 
overlaps  significantly  with  the  others,  and  thus  can  be  expressed 
efficiently. 

E.  Second  Measurement  Update  -  The  Characteristic  Function 
Using  (49)  (50)  (51)  (52)  (53),  and  denoting  A/2|2  «  9  for  the 

number  of  terms,  the  CF  of  the  uepdf  for  the  second  measurement 
update  is  given  by 


(  . 

<21200=  ^21200- 

exP  f-mOO 

(59a) 

i=  1 

where  , 

iyW 

<.*.  X 

x  ZiBMi  1/ 

Ei\2(Y)  =  ~  M 

(59b) 

pt  EP\iv  1  j 

212 

=1  =1 


and  the  coefficients  are  given  by  (57)  and  (58)  for  new  and  old  terms, 
respectively.  Note  that  the  first  three  terms  in  <p2|2  conespond  to  the 
oldJerms  and  have  the  same  exponential  parts  as  the  three  tenns 
in  i  2| j .  The  subsequent  six  tenns  are  new  and  involve  only  two 
fundamental  directions.  Although  not  used  her^,  it  will  be  useful  to 
denote  the  number  of  new  terms  in  <p2|2  as  ^2\2  ~  G  This  stmeture 
second  measurement  update  are  appended  to  the  bottom  of  G(2 1 1)  = 


repeats  itself  across  subsequent  measurement  updates,  and  the  next 
section  will  present  the  two-state  estimator’s  recursion  for  the  general 
kth  measurement  update. 
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VII.  Estimator  Recursion  for  the  ktb  Update  and 
Propagation 

This  section  presents  the  two-state  Cauchy  estimator  algorithm 
for  a  general  measurement  update  k.  This  algorithm  as  derived  by 
induction  based  on  a  study  of  the  first  three  measurement  updates, 
of  which  two  are  shown  here.  It  can  be  verified  to  be  a  special  case 
of  the  approach  developed  in  [12, 13].  We  first  address  the  update  at 
time  step  k ,  and  then  follow  with  the  time  propagation  step.  Finally, 
we  show  how  to  determine  the  minimum  variance  estimate  and  its 
error  variance. 


f,  —  L'f’  =  lj ' ' '  ’  L'k\k-i  ~r 

k\k-l  —  r  +  1,  -  -  - ,  Lk I Ar— 1  +  1 


Using  the  shorthand  p  as  in  the  second  measurement  update  (45), 
construct  p  ,  $,  p  ,  p  ,  and  p  constants  as  in  (40b),  (41b),  (41cj, 
and  (44),  respectively,  in  order  to  write  the  coefficients  in  the  1 
integral  in  (6 1 )  as 


A.  Measurement  Update 

We-begin  this  section  assuming  we  have  a  time  propagated  ucpdf  s 
CF  |  and  are  given  the  klb  measurement  Z(k).  Consider 

the  following  generalization  for  the  polynomial  coefficients  G'k^k_} 

produced  for  the  old  terns  in  the  second  measurement  update.  The 
form  in  (58)  suggests  that,  in  general,  the  time  propagated  coefficients 

will  have  the  form 

(Gkik-liv)  =  a,-  +  bi  sgn  Sgn  <[k-lV 
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0'  1  m}  T  _  1  M  ht  s 

k-r  =  Z(k  -  n  -  Z,  Bk  u  H  Z<  Bk-rjkrrH  j  (60c> 

and  k  —  r  is  the  time-step  where  the  term  involving  was 

k-r 

created.  The  arrays  M.L.P.B,  and  Z  for  the  general  update  will 
be  constructed  later  but  correspond  to  those  of  the  first  two  updates 
already  shown. 

Using  the  measurement  update  formula  (18)  with  the  correspond¬ 
ing  change  of  indices,  (pk\k  is  given  by 
WfyfM  °° 

**1*00=  ^  Gfc|*-10'-HTo)x 

/=  1 

-0°  g 

exp  ^  Uk\k-p  gM,  -  y\-a\  +jd'  ak  da.  (61) 

D  i  k\k-\  i 


Then,  the  entire  integral  can  be  written  compactly  as 


□  .  C 
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=1 

□  □ 
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=i 


The  solution  to  this  integral,  given  in  [12, 13],  is 


□  □ 

_  □  ^I>v*+1  /  □ 

n  —  '  exp  x 
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It  is  necessary  to  divide  the  domain  of  integration  into  regions  in 
which  the  polynomial  coefficient  is  a  constant  and  the  exponential 
argument  is  continuous.  For  this,  the  summations  that  appear  in 
(60)  must  involve  the  same  fundamental  directions  as  the  expo¬ 
nential  argument.  Since  the  top  bracket  tern  in  (60a)  is  the  four 
parameter  polynomial  formed  when  the  tern  was  created,  we  can 

define  p  and  pm  constants  as  in  (44),  where  p  —  p  =  0  for 
+1 

k\k~  1  ' 

Similarly,  the  summations  in  (60b),  from  the  rest  of  the  bracket 
terns  in  (60a),  can  be  written  using  constants  {r)p  defined  as 


x  n  .  ■  z_*i\-i+1 

jd'+Pm+  )’  P  Sgn  (p  ~  pm) 

□  =1 
/=m 
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j %-Pn 


,  (65a) 
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where 
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wheren<  =  —A/*  n*  noting  that  A/k|*-i  —  A/*— i  |*-i 

andA/yt_j  =  Nk-uk-\- The  left  column  of  this  new  block  has  three 
parts.  The  first  Ar  ^  elements  are  the  left  elements  of  all  the 
new  rows  of  the  previous  measurement  update,  i.e.  the  new  Nn  _  , 

k  Vk  1 

rows  of  M(k  —  1 1  /c  —  1).  The  next  element  corresponds  to  the  new 
fundamental  direction  from  the  previous  measurement  update,  i.e. 
the  last  row  of  S*— 1 1*— i.  The  remaining  Nk-i\k-i  rows  are  all  the 

same,  and  involve  the  new  time  propagation  direction  FT. 

The  growth  in  the  number  of  tenns,  which  is  given  as  a  linear 
dynamic  system  of  integers,  is  based  on  the  pattern  for  the  recursion 
of  M(k\k)  given  above.  The  number  of  terms  in  the  sum  in  <jp  , 

given  by  Nk\k,  is  determined  from  the  previous  number  of  terms 

Nk-i\k-i  and  the  previous  number  of  new  terms  Nk-i\k-i  by  the 
following  linear  relationship: 

r  I  r  lr  I  r  I 

Nk\k  _  2  1  Nk_Uk_l  +  1 

N  Nn 

^ k  1  1  k—  1  |/c—  1  1 

The  recursions  for  the  fundamental  arrays  P  (k  \  k)  and  Z(k\k)  were 
derived  by  induction  from  a  study  of  the  first  three  measurement 

updates.  As  in  the  first  two  measurement  updates,  the  denominators 
of  the  elements  of  the  /1h  row  of  Z(k\k)  are  equal.  Therefore,  it  is 
useful  to  denote  the  diagonal  matrix  D(k\k),  where 


D(k\k)  =  Diag 


B.  Measurement  Update  -  Recovering  the  CF  Structure 


The  old  terms  will  be  discussed  later.  For  the  new  terms,  i.e., 
m  ~  !>■■■>  l~k\k-  the  numerators  of  the  measurement  updated 
coefficients  (65b)  can  be  reduced  to  a  form  compatible  with  Result 
3.  The  four  parameter  bracket  tenns  in  (65b)  can  be  rewritten  as  (56) 
from  the  second  measurement  update.  Manipulate  the  bracket  term 
outside  the  product  in  (65b)  in  the  same  manner  as  (54)  and  (55). 
Then,  the  entire  numerator  form  (65b)  can  be  collapsed,  using  Results 
1  and  2,  into  the  fonn  given  in  (56).  Finally,  the  new  four-parameter 

polynomial  coefficient  involving  two  fundamental  directions  (HA  and 
a  row  of  Bk\k-\)  can  be  computed  as  in  the  second  measurement 

update  using  Result  3.  Those  parameters  will  be  combined  with  other 
terms  with  the  same  exponential  arguments,  the  set  of  new  parameters 
will  be  appended  to  the  bottom  of  G(k\k  —  1)  to  form  a  new. array 
denoted  G(k\k).  Denote  the  number  of  the  new  terms  in  cp  as 

k\k 


^Mr^k+\  .  j 


Ck+Nk  k  )  T 

H  | 

1  1  1 
cfTHT’  HT’  '"’HT  ■ 


Similarly,  the  denominators  of  the  elements  of  the  /,h  row  of  P(k\k) 
are  also  equal.  Denote  the  diagonal  matrix  D(k\k),  whose  elements 
equal  the  absolute  values  of  the  corresponding  elements  in  D(k\k). 

The  new  terms  in  P(k\k)  and  Z{k\k)  are  fonned  in  the  same 
manner  as  in  the  first  and  second  measurement  update,  reducing  sums 
of  //  —/7m  terms  to  constants  times  HA  and  reducing  the  polynomial 
coefficients  to  the  four  parameter  structure  in  (23)  using  the  Results 
in  Appendix  A,  producing  the  new  rows  for  G(k\k).  The  recursion 
for  P(k\k)  is 

Pik\k)= 

Pcfclfc-l) 
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Since  one  of  the  two  fundamental  directions  for  every  new  term 
will  be  HA,  it  is  appended  to  S#c|*— 1  to  obtain 
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Y  |det  O  |  +  /3  HA O  1  A  T 


M 


) 


HA 


(66 

□  □ 


□□ 

; 


The  array  M  (k  \  k)  is  constructed  by  appending  A/^  ^  new  rows  to 
M(k\k  —  1).  The  recursion  is  given  by 
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<-Nk\k-\> 

*-  /._1  -1 


Ar 


=i 


p 

Nk-l\k-l 


—  1 1  At—  1 
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Ar 


nD 


(70) 
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Similarly,  the  recursion  for  Z(k\k)  is 
Z(k\k)  = 


□lD(/c|/c)x 

m  m 

an 
□□ 

Qfl 


Z(k\k  -  l) 


z(k) 

z(k) 


0 
0 

-Z2  •  detO 

(Of  +3)  /f-n/c-i 


\ 


□m  ' 

L  □  z(k)  -Z2 


n  KB^rk+N*-'yHT  -detO 


CL 

-Z(/C) 


=1 


□  □ 

uu 


Z{k) 


-%z, 

=  1 


~Z(k  -  1)  ■  det  O 

Z(Nk-i  ,/f_i)  ^|/tAr 
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era 


SB 


M(Nk 


B„ 


1  “  AT 


BB 


(71) 


Since  all  the  new  rows  in  M(k\k)  have  width  two,  Lk\k  is  measure¬ 


ment  updated  by  appending  N 


k\k  elements  of  the  integer  2  to  Lk\k-i 


-  f 

k\k  -  L 


1 


k\k-l 


(72) 


Thisproducesalloftheparametersnecessaiyto  express  the  uepdf  s 
CF  for  the  kth  measurement  update  as 

Nk\k 

V  .  (v) 

cp  ( 1/ )  =  G' 

/t|/t  i=1  k\k 


□  ,K„ 

X  exp  L  —  ~  kikP  BMiV  +j 
□  1  k\k 

=  1 


O" 


z.bm ^  1/  , 

•  k\k 


=  1 


(73) 


where  coefficients  are  given  by 

Gj<\k(v)  =  a i  +  bi  sgn  v  sgn  (BMe  v 
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k\k 


—2 


k\k 

7  n 


r=l 


yc/sgn  +  yd/ sgn  6^; 

r 

1 

i&i<-r  +  Y  +  S‘k_r(Bk\kV) 


1 

2rf 


7&~ 


where 

S'k-r(Bk\kV) 


u f 


k—r 

( 


/  +  S' — (BkkV) 


(74a) 


/<— r  I 


r  v  M  T  ( 

Pi  BkJr\k_r H  sgn  £%V 

=t 


(74b) 

„/  1  Mj  T  2  Mr  T 

/f-r  =  z(k-  r)~  ZiBk_r,k_r H  —  Z'  e/f-r|/(-/-H  ’  <74c> 


and  /f — r  is  the  time-step.where  the  term  involving  S  k  was  created. 
Hence,  the  age  of  the  fn  term  after  the  /ctn  update  is  L  „  . 

Ar|Ac  —  2,  and 

new  terms  have  age  zero. 


C.  Time  Propagation 

The  general  time  propagation  uses  the  same  formula  as  in  the  first 
time  propagation, 


f3  rT\/ 

1  k+i\k(v)  ~  <Pk\k(®'v)  ■  e 


N.k^k 

<k+uk(y)=  Gk+i\k(v) 


X  exp 
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Pi  B 


=i 


k+\\k 


=  1 


k+l\k 
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(76) 


where  g 

P{k+X]k)  =  QP(k  |c)  PQ  ,  Z(k  +  1  |/c)  -  Z(/c  jo. 
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M(k  +  1  k)  - 

2(*+l)H 

1 

B  ' 

:  □’ 

2  (k  +  1) 

□ra 

pip 

r  1 

Bk\k  ■  (bT 

Lk+\\k 

—  Lk\k  + 

D  D  _ 

B  ’  g/c+1  \k  — 
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rT 
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(77) 

and  both  Nk+\\k 

-  Nk\k  and  Nk+l\k  -  Nk\k  because 

no  new  terms 

are  created  during  the  time 

propagation. 

Z).  Evaluating  the  Conditional  Mean  and  Estimation  Error  Variance 


It  is  shown  in  [12, 13]  that  <pk\k(V)  is  twice  continuously  differen¬ 
tiable.  The  mean  can  be  found  from  the  unnormalized  characteristic 
function  [1]  by  taking  its  partial  derivative  and  then  taking  the  limit 
as  V  goes  to  the  origin. 

(  \t 


k{k)  =  E\x{k)\Zk\=  deputy)  ,  (78a) 

Jf  \Z) 

Zk  k  . 


where 


<(Z)=  cp 


V) 


(78b) 


k\k 


The  second  moment  can  be  found  by  taking  the  same  limit  of  the 

second  partial  derivative  of  the  characteristic  function,  as 
E  x(k)x(k)  -1  d~w  (v) 


\Zk  = 

fzk  ( Zk ) 

and  the  error  variance  is  given  by 

S(/c)  =  E  x(k)x(k)T 

\Zk 


xk\zk 

51/l/T  ^0’ 

%k)%k)Y. 


(78c) 

(78d) 


The  limits  above  must  be  taken  along  valid  directions,  due  to  the 
structure  of  <Pk^k(Y)  in  (76).  Since  this  structure  is  a  special  case  of 
the  structure  in  [12, 13],  the  details  of  these  operations  can  be  found 
there. 


VIII.  Finite  Horizon  Approximation 

In  order  to  arrest  the  growth  in  computational  complexity,  we 
approximate  the  full  information  CF  with  one  using  a  fixed  slid¬ 
ing  window  of  the  most  recent  measurements,  where  number  of 
measurements  in  this  horizon  is  denoted  Nz .  Hence,  the  first  Nz 
measurement  updates  in  the  estimation  are  performed  normally.  Then, 
for  every  measurement  update  k  >  Nz,  wc  initialize  a  new  finite 

horizon  (FH)  estimator  and  perfonn  Nz  measurement  updates  over 
the  fixed  window  {zlk  —  Nz  +  1 Z(k)}.  This  new  initial 

condition  for  the  FH  estimator  is  of  the  form 

k  w  l  tv  2 

<P 

wi\o(V)  =  exp  ~ai  BtviicV  -or2  Byvi\ 


(75) 


w 


r 


+jx 

1  Vl  | 

cos  0  sih  0 

j*2 

V2  , 

The  characteristic  function  of  the  ucpdf  for  the  once  propagated 
conditional  density  can  be  written  as 


(79) 
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where  Bwi  |o  is  a  rotation  matrix.  Denote  the  windowed  first  mea¬ 
surement  update  mean  and  variance  by  ^ (1)  and  The 

III  Hill! 

12  12 

,  and  of  the  updated  FH  first  update  CD 

01  1  k-Nz+i \k-Nz+\  tVl|l 

This  process  involves  solving  five  nonlinear  equations  with  the  five 
unknowns  stated  above,  which  is  carried  out  using  standard  numerical 
tools.  It  is  necessary  to  make  use  of  the  decomposition  in  (5a)  in  order 
to  apply  the  proposed  algorithm  to  the  initial  condition  in  (79). 

This  local  first  measurement  updated  CF,  then,  has  the  same  mean 
and  variance  as  the  original  CF  we  are  approximating.  The  remaining 
Nz  —  1  measurement  updates  are  performed  over  the  measurements 
in  the  window,  ultimately  producing  the  A/z-measurement_updated 
CF  i  wNz\Nz  ■  This  CF  is  taken  as  the  approximation  of  cp  ,  i.e., 
kk.  Flence,  for  k  —  Nz  +  1  <  Nz  the  FF1  initial 

_  ~  ,  which  is  conditioned^&i  the 

W  Nz  |  Nz  ~  V  I  _ 

condition  in  (79)  approximates  cpk ,k 

entire  measurement  history.  Then,  for  k  —  Nz  +  1  >  A/z_ilie  FF1 
initial  condition  approximates  the  mean  and  variance  of  (p  z+l  , 

WNZ\NZ 

produced  by  a  previous  iteration  of  this  process. 

Numerical  comparisons  have  shown  that  the  local  initial  condition 
found  in  this  way  performs  well  in  reproducing  the  full  infonnation 
mean  and  variance.  Moreover,  simulations  have  shown  that  the 
finite  horizon  mean  and  variances  agree  very  closely  with  the  full 
information  case  even  with  horizon  lengths  as  small  as  8,  as  shown 
in  the  next  section. 

IX.  Numerical  Examples 

We  present  a  set  of  four  examples  demonstrating  the  perfonnance 
of  our  proposed  two-state  estimator.  The  main  challenge  in  imple¬ 
menting  this  estimator  is  the  growth,  with  each  measurement,  of  the 
number  of  terms  needed  to  express  the  cpdf  s  CF.  The  proposed  two- 
state  estimator  is  more  efficient  and  produces  far  fewer  terms  than 
the  general-state  estimator  presented  in  [12, 13],  The  improvement  in 
perfonnance  is  quantified  in  the  table  below,  comparing  the  number 
of  terms  in  the  sum  for  a  two-state  implementation  of  [12, 13]  to  the 
number  produced  by  the  proposed  algorithm,  given  by  (68). 


Measurement  Update  k 

8 

10 

12 

Nk\k  of  Previous  [12,  13] 

75036 

1389207 

25719609 

Nk\k  of  Proposed 

3193 

21891 

150049 

Percent  Retained 

4.3% 

1.6% 

0.58% 

However,  the  proposed  estimator  algorithm  still  suffers  from  the  same 
fundamental  issue  of  growing  complexity.  This  motivated  the  use  of  a 
fixed  window  of  the  most  recent  measurements  discussed  in  Section 
VIII,  the  performance  of  which  is  discussed  next. 


A.  Finite  Horizon  Accuracy 

Figure  1  shows,  on  a  logarithmic  scale,  differences  between  the 
elements  of  the  estimated  state  and  error  variance  between  the 
finite-horizon  and  full-information  estimators,  normalized  to  the  full- 
information  values;  denote  this  normalized  difference  of  a  given 
element  as  e(-).  The  system  parameters  used  are  /3  =  0.5,  y  =  0.1, 
Ofi  =  a2  =  0.8,  eig(O)  =  0.8  ±  0.55 j,  H  =  [1  1],  and 

f  ft  [0.5  1]T.  We  compare  the  performance  of  horizon  lengths 

of  Nz  =  8  (dashed  lines)  and  Nz  =  10  (solid  lines).  The  subscripts 
indicate  which  element  of  the  state  estimate  vector  and  error  variance 
matrix  are  being  compared.  These  results  show  that  this  finite  horizon 
approximation  is  very  accurate,  with  errors  approximately  between 
0.01%  and  0.0001%  for  our  example  and  these  two  horizon  lengths. 


9  10  11  12  13  14 

Time  Step 


Fig.  1 :  Comparison  of  two  finite  horizon  estimators  (dashed  is  8,  solid 
is  10)  to  the  full  infonnation  estimator’s  means  and  error  variances. 


B.  Cauchy-Gaussian  Comparison 

In  our  examples  we  compare  the  performance  of  our  Cauchy 
estimator  to  the  Kalman  filter.  To  do  this,  we  need  to  be  able  to 
choose  Gaussian  parameters  for  the  Kalman  filter  that  approximate 
the  Cauchy  parameters.  To  construct  a  normal  or  Gaussian  pdf  that 
best  fits  a  given  Cauchy  pdf,  the  following  optimization  problem  is 
solved  I 

O*  m  argmin  ^  fjX)  -  f^(X')  2  dx  (80) 

a 

_7~CO  5/n 

where  the  Cauchy  pdf  is  i  (X)  =  ,  5  >  0  and  the  normal  pdf 

x  ■  <V' 

is  given  by  f^(X)  -  - — ,  a  >  0.  Solving  (80)  analytically 
leads  to  a  complex  nonlinear  equation  relating  a*  to  5.  Solving  the 
latter  numerically  yields  O*  =  k$5,  kg  ~  1.38980. 

C.  Simulations 

The  simulations  in  Figs  2,  3,  and  4  all  use  the  same  dynamics, 
where  the  eigenvalues  of  the  transition  matrix  are  eig(d>)  =  0.8  ± 

0.55 j,  H  =  [1  1],  and  T  =  [0.5  1]T.  The  initial  condition  has  a 
zero  median  and  Cfi  =  Cf2  —  0.8.  All  simulations  use  a  measurement 
horizon  length  of  Nz  -  10.  In  Fig.  2,  y  =  0.5  and  |8  =  0.1  so  that 
the  measurement  noise  dominates  the  process  noise;  in  Fig.  3  the 
parameters  are  interchanged  so  that  the  process  noise  dominates  the 
measurement  noise.  Gaussian  parameters  used  for  the  LEG  and  for 
Gaussian  noises  are  closest,  in  the  Lo  sense,  to  their  corresponding 
Cauchy  distributions.  For  clarity  of  presentation  in  the  figures,  the 
first  update  occurs  at  k  —  0  instead  of  k  =  1. 

Figures  2  and  3  compare  the  Cauchy  and  Kalman  filters’  responses 
to  Cauchy  distributed  noises,  and  Fig.  4  compares  their  response 
to  Gaussian  distributed  noises.  Figures  2b  and  3b  show  the  same 
data  as  Figs.  2a  and  3a  when  zoomed  in  around  zero  to  demonstrate 
more  clearly  how  the  controllers  respond  when  noise  impulses  are 
encountered. 

When  facing  Cauchy  distributed  noises  the  proposed  estimator 
outperforms  the  Kalman  filter,  especially  when  y  >  \ 8  as  in  Fig. 
2.  In  this  case,  the  Kalman  filter’s  estimation  error  is  almost  always 
larger  than  that  of  the  Cauchy  estimator.  Moreover,  the  impulsive 
noise  values  cause  the  conditional  variance  computed  by  the  Kalman 
filter  to  be  orders  of  magnitude  smaller  than  the  exact  conditional 
variance  computed  by  the  Cauchy  estimator. 

In  Fig.  3,  where  /3  >  y,  both  the  Cauchy  estimator’s  and  the 
Kalman  filter’s  estimation  errors  appear  to  have  similar  performance. 
However,  the  exact  values  of  the  conditional  error  variance  computed 
by  the  Cauchy  estimator  are  quite  different  from  the  error  variance 
computed  by  the  Kalman  filter.  In  contrast  to  Fig.  2,  in  the  case 
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where  /3  >  /  the  Kalman  filter’s  estimation  error  decays  faster  due 
to  the  larger  Kalman  filter  gain,  since  the  process  noise  parameter 
dominates  the  measurement  noise.  This  faster  decay  leads  to  the 
similar  performance  exhibited  by  both  estimators. 

During  periods  in  the  simulation  without  large  impulses,  the 
Cauchy  and  Kalman  filters  have  similar  performance,  as  shown  in 
Figs.  2b  and  3b.  This  suggests  that  in  a  non-impulsive  noise  setting, 

the  two  estimators  would  have  similar  performance.  In  a  Gaussian 
noise  setting,  shown  in  Fig.  4,  the  Cauchy  filter  performs  very  well.  It 
approximates  the  variance  of  the  optimal  Kalman  filter  and  tracks  its 
mean  very  closely.  This  demonstrates  the  robustness  of  the  Cauchy 

estimator  in  a  Gaussian  noise  environment. 

X.  Conclusions 

An  efficient  two-state  Cauchy  estimation  algorithm  able  to  operate 
over  an  indefinite  number  of  measurement  updates  is  derived  and 
presented.  Although  the  estimator  here  is  a  special  case  of  the  general 
Cauchy  estimation  framework  in  [12, 13],  the  structures  presented  in 

this  work  take  advantage  of  relationships  currently  understood  only 
for  the  two-state  system,  allowing  the  development  of  an  efficient 
recursive  estimation  structure.  A  method  for  using  a  finite  window  of 
measurement  is  proposed,  and  example  simulations  are  presented  to 

demonstrate  the  performance  and  robustness  of  the  Cauchy  estimator 
over  a  large  number  of  measurement  updates. 

Appendix 

Result  1:  The  product  of  two  terns  given  by 
{ am  +  jdm  ■  sgn  (HAV)} 

x'r 

jQ  +  y  +  P  1  sgn  (HAv)  jQ  —  /  +  Q  -  sgn  (HAy) — 

=  aom  +Jdmo  ■  sgn  (HAy)  (81a) 


and 

a  =  —2 /  p^  +  D2  -  Y2  ~  S2  , 
c  =  AGypm, 


Jm 


D2  -y  -  8 


„2  2 


b  =_-AypmD, 
d  =  4  0/D, 


f=4(pmD)-  Pm  +  D"  ~Y_ 
A  =  n  ■  e2  -  f2 
Result  3:  The  term  given  by 


4  (PmDy 
AA(PmG')  '  4  (QD)  , 
3  +8  Q2pmD, 
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j8  +  Pm  —  Y  ■  sgn  Bk.'  y  +  D  ■  sgn  (HA l/) 
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(82c) 


-sgn  (HAy) 


jG  ~  Pm  -  y  ■  Sgn  B  V  +  D  ■  sgn  (HA  I/) 
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am  +  bm  ■  sgn  kBm>  y  sgn  (HA I/) 

A  jdm  ■  sgn  (HA\/)  (83a) 


(  k\k 
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where  the  /  subscript  denotes  the  input  term  and  the  O  subscript 
denotes  the  outpu(term, 

1  Mx  e-  -f  0  0^  na  b  c-^n 
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(83b) 


(83c) 


where  the  /  subscript  denotes  the  input  term  and  the  O  subscript 
denotes  the  output  term, 
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(81b) 


and 

a  =  -2 /_■  D2  -  y2  —  02  , 

d  =  40/D, 

e  =  D2  —  y2  —  G2  2  +  4  (QD)2 


(81c) 


Result  2:  The  product  of  two  terns  given  by 

{am  +  bm  ■  sgn  (HAV)  +  jcm  +  jdm  ■  sgn  (HAv)> 

/  7  7 

i 

X  _ 

jQ  +  Y  +  pm  +  D  ■  Sgn  (HAv) 

1 

jQ  -  Y  +  pm  +  D  ■  sgn  (HAv) 

=  am  +  bm  ■  sgn  (HAV)  +  jcm  +  yd  ■  sgn  (HAV)  ,  (82a) 

o  o  o  o 


where  the  /  subscript  denotes  the  input  term  and  the  O  subscript 
denotes  the  output  term, 
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e  =  a2  +  d2  +  c2  +  d2,  f  —2  ab  +  cd  , 

A  =  tt  ■  e2  -  f2  . 

References 

[1]  J.  L.  Speyer  and  W.  H.  Chung,  Stochastic  Processes,  Estimation,  and 
Control.  SIAM,  2008. 

[2]  jsl.  Is|.  Taj|b,  The^^cjck  Swan:  The  Impact  of  the  Highly  Improbable. 

[3]  E.  E.  Kuruoglu,  W.  J.  Fitzgerald,  and  P.  J.  W.  Rayner,  “Near  optimal 
detection  of  signals  in  impulsive  noise  modeled  with  asymmetric  alpha- 
stable  distribution,”  IEEE  Communications  Letters ,  vol.  2,  no.  10,  pp. 
282-284,  Oct.  1998. 

[4]  P.  Reeves,  “A  non-gaussian  turbulence  simulation,”  Air  Force  Flight 
Dynamics  Laboratory,  Tech.  Rep.  AFFDL-TR-69-67,  1969. 

[5]  G.  Samorodnitsky  and  M.  S.  Taqqu,  Stable  Non-Gassian  Random  Pro¬ 
cesses:  Stochastic  Models  with  Infinite  Variance.  New  York:  Chapman 
&  Hall,  1994. 

[6]  P.  Tsakalides  and  C.  L.  Nikias,  Deviation  from  Normality  in  Statistical 
Signal  Processing:  Parameter  Estimation  with  Alpha-Stable  Distribu¬ 
tions;  in  A  Practical  Guide  to  Heavy  Tails:  Statistical  Techniques  and 
Applications.  Birkhauser,  1998. 

[7]  G.  A.  Tsihrintzis,  Statistical  Modeling  and  Receiver  Design  for  Multi- 
User  Communication  Networks;  in  A  Practical  Guide  to  Heavy  Tails: 
Statistical  Techniques  and  Applications.  Birkhauser,  1998. 

[8]  G.  A.  Hewer,  R.  D.  Martin,  and  J.  Zeh,  “Robust  preprocessing  for  kalman 
filtering  of  glint  noise,”  IEEE  Transactions  on  Aerospace  and  Electronic 
Systems ,  vol.  AES-23,  no.  1,  pp.  120-128,  January  1987. 

[9]  J.  P.  Nolan,  J.  G.  Gonzalez,  and  R.  C.  Nunez,  “Stable  filters:  A  robust 

noise”  in  In  Proceedings 

[10]  M.  Idan  and  J.  L.  Speyer,  “Cauchy  estimation  for  linear  scalar  systems,” 

—f  e 


o_  d  o  a 

d  c  a 
b 


IEEE 
Tmnsacti 
ons  on 
Automati 
c 

Control , 
vol.  55, 
no.  6,  pp. 
1329- 
1342, 


di 


(82b) 


2010. 


15 


[11]  - ,  “State  estimation  for  linear  scalar  dynamic  systems  with  additive 

cauchy  noises:  Characteristic  function  approach,”  SIAM  J.  Control 

Optim.,  vol.  50,  no.  4,  pp.  1971-1994,  2012. 

[12]  - ,  “Multivariate  Cauchy  Estimator  with  Scalar  Measurement  and 

Process  Noises,”  in  Proceedings  of  the  52nd  IEEE  Conference  on 
Decision  and  Control ,  Florence,  Italy,  December  2013. 

[13]  - ,  “Multivariate  cauchy  estimator  with  scalar  measurement  and 

process  noises,”  Submitted  to  SIAM ,  2013. 

[14]  - ,  “Characteristic  Function  Approach  for  Estimation  of  Scalar  Sys¬ 

tems  with  Cauchy  Noises,”  in  AIAA  Guidance,  Navigation,  and  Control 
Conference,  Toronto,  Ontario  Canada,  Aug.  2010. 

[15]  D.  Champeney,  A  Handbook  of  Fourier  Theorems.  New  York:  Cam¬ 
bridge  University  Press,  1987. 

[16]  C.-T.  Chen,  Linear  System  Theory  and  Design,  3rd  ed.  New  York: 
Oxford  University  Press,  1999. 


Q_ 


l“ 

~w 

1 

v  fv 

A 

/v  /V 

/V 

-  A 

V 

l  / 

1/ 

y 

0  20  40  60  80  100 

Time  Steps  ( k ) 

(a)  Full  view. 


Fig.  2:  Cauchy  and  Kalman  estimators  for  y  >  /3;  thick  lines  are  the 
estimate  errors,  and  thin  lines  are  the  standard  deviations. 
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Fig.  4:  Cauchy  and  Kalman  estimators  against  Gaussian  noises;  thick 
lines  are  the  estimate  errors,  and  thin  lines  are  the  standard  deviations. 
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A  stochastic  control  scheme  is  developed  for  scalar,  discrete-time,  and  linear-dynamic  systems  driven 
by  Cauchy  distributed  process  and  measurement  noises.  When  addressing  the  optimal  control  problem 
for  such  systems,  the  standard  quadratic  cost  criteria  cannot  be  used.  In  this  study  we  introduce  a  new 
objective  function  that  is  functionally  similar  to  the  Cauchy  probability  density  function.  The  performance 
index,  defined  as  the  expectation  of  this  objective  function  with  respect  to  the  Cauchy  densities,  exists. 
The  dynamic  programming  solution  to  the  fixed  and  finite  horizon  optimal  control  problem  that  uses 
this  performance  index  appears  to  be  intractable.  Therefore,  a  moving  horizon  optimal  model  predictive 
control  problem  is  implemented,  for  which  the  conditional  expected  value  of  the  objective  function  and 
its  gradients  can  be  computed  in  closed  form  and  without  assumptions  such  as  certainty  equivalence. 
Numerical  results  are  shown  for  this  m-step  model  predictive  optimal  controller  and  compared  to  a 
similar,  Linear-Exponential-Gaussian  model  predictive  controller.  An  essential  difference  between  the 
Cauchy  and  Gaussian  controllers  when  applied  to  a  system  with  Cauchy  noises  is  that,  while  the  Gaussian 
controller  is  linear  and  reacts  strongly  to  all  noise  pulses,  the  Cauchy  controller  can  differentiate  between 
measurement  and  process  noise  pulses  by  ignoring  the  former  while  responding  to  the  latter.  This 
property  of  the  Cauchy  controller  occurs  when  an  impulsive  measurement  noise  is  more  likely  than  an 
impulsive  process  noise.  The  Cauchy  and  Gaussian  controllers  react  similarly  when  applied  to  a  system 
with  Gaussian  noises,  demonstrating  the  robustness  of  the  proposed  control  scheme. 

©2013  Published  by  Elsevier  Ltd 
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1.  Introduction 

Modern  stochastic  optimal  control  algorithms,  such  as  the 
Linear-Quadratic-Gaussian  (LQG)  and  B.inear-Exponential-Gaus- 
sian  (LEG)  algorithms,  assume  that  the  system  is  driven  by  addi¬ 
tive  Gaussian  process  and  measurement  noises  (Speyer  &  Chung, 
2008,  Chapters  9  and  10).  Since  the  Gaussian  probability  density 
function  (pdf)  is  a  light-tailed  pdf,  which  essentially  rules  out  the 
possibility  of  large  deviations,  these  algorithms  are  unable  to  han¬ 
dle  measurement  outliers  that  produce  large  filter  residuals  due  to 
impulsive  changes  in  the  measurements.  For  example,  the  govern¬ 
ing  types  of  noises  that  occur  in  radar  and  sonar  applications  are 
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atmospheric  and  underwater  acoustic  noises.  Noises  of  this  type 
exhibit  very  impulsive  behaviors  that  are  not  captured  by  a  Gaus¬ 
sian  distribution  (Kuruoglu,  Fitzgerald,  &  Rayner,  1998).  Dynamic 
impulsive  noises  can  be  used  in  modeling  unknown  adversarial 
motion,  as  well  as  for  modeling  airturbulence,  which  was  shown  to 
be  better  represented  by  non-Gaussian,  heavy-tailed  distributions 
(Reeves,  1969).  The  literature  on  how  to  handle  outliers  is  dom¬ 
inated  by  heuristic  methods  that  assume  a  Gaussian  distribution 
for  the  underlying  stochastic  processes.  Moreover,  these  methods 
typically  work  for  a  posteriori  analysis  of  static  problems,  which 
is  inadequate  for  control  (Fernholz,  Morgenthaler,  &  Tukey,  2004; 
Hampel,  Ronchetti,  Rousseeuw,  &  Stahel,  1968;  Holland  &  Welsch, 
1977;  Pirinen,  2008). 

In  this  paper,  the  proposed  controller  is  based  on  a  discrete 
time  linear  dynamic  system  with  Cauchy  distributed  process  and 
measurement  noises  and  initial  condition.  The  Cauchy  probabil¬ 
ity  distribution  function  (pdf)  is  in  the  class  of  probability  distri¬ 
butions  called  symmetricalpha-stable  (SO-S )  distributions,  whose 
members  are  described  using  their  characteristic  functions  (see 
Samorodnitsky  &  Taqqu,  1994  for  a  comprehensive  treatment  of 
SO-S  densities).  Within  this  class:  the  Gaussian  pdf  corresponds  to 
O  =  2;  O  =  1  leads  to  the  Cauchy  pdf;  all  pdfs  with  O  <  2  have 
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an  infinite  variance,  and  the  Cauchy  pdf  lacks  a  defined  mean  or 
first  moment.  However,  for  Cauchy  uncertainties  the  conditional 
mean  of  the  state  and  its  conditional  variance  given  a  measure¬ 
ment  history  do  exist  (Idan  &  Speyer,  2008,  2010).  An  important 
aspect  as  well  is  that  the  conditional  error  variance  is  a  function  of 
the  measurement  history  (Idan  &  Speyer,  2008, 2010),  quite  unlike 
the  Gaussian  case  where  the  conditional  error  variance  is  a  priori 
known. 

One  of  the  fundamental  lessons  learned  from  the  results  pre¬ 
sented  in  this  paper  is  that  the  handling  of  measurement  outliers 
directly  involves  the  stochastic  controller.  A  stochastic  control  law 
has  to  be  consistent  with  the  underlying  structure  of  the  condi¬ 
tional  pdf  of  the  state  variable  and  with  the  assumptions  used  in 
its  construction.  Under  the  Gaussian  assumption,  only  the  state  es¬ 
timate  depends  on  the  measurement  while  the  error  variance  of 
the  estimate  does  not.  Since  optimal  controllers  based  on  Gaus¬ 
sian  noises  (e.g.  LQG  and  LEG  controllers)  are  linear  in  the  mea¬ 
surements,  they  do  not  differentiate  between  outliers  and  normal 
measurements  and  respond  to  both  of  them  in  the  same  manner. 

Since  stochastic  optimal  control  algorithms  are  developed 
based  on  the  minimization  or  maximization  of  an  expectation  of  an 
objective  function,  an  appropriate  computable  objective  function 
hasto  be  determined  for  systems  with  Cauchy  noises.  The  objective 
functions  normally  used  in  the  LQG  and  the  positive  LEG  (by 
which  we  mean  that  the  quadratic  argument  of  the  exponential 
is  positive)  control  settings  are  not  suitable  here  because  those 
expectations  are  either  undefined  or  infinite  for  Cauchy  pdfs.3 
Hence,  a  different  objective  function  has  to  be  chosen  for  this  case. 
We  chose  an  objective  function  that  is  a  product  of  functions  that 
resemble  scaled  Cauchy  pdf’s  in  structure  and  depend  on  the  state 
or  control  variable.  For  Cauchy  uncertainty,  the  expectation  of  this 
objective  function  is  finite  and  its  conditional  expectation  given  the 
measurement  history  can  be  expressed  as  a  closed  form  function 
of  the  measurements  and  controls.  A  similar  performance  index 
formulation  was  made  for  the  LEG  problem,  where  the  objective 
function  was  constructed  as  a  product  of  exponential  functions 
of  the  state  and  the  control  (Jacobson,  1 973)  resembling  Gaussian 
pdf’s  in  form. 

Based  on  the  proposed  objective  function,  a  dynamic  program¬ 
ming  recursion  rule  is  developed  to  construct  the  optimal  con¬ 
trol  function.  However,  the  application  of  this  recursion  rule  to 
the  finite  fixed  horizon  optimal  control  problem  for  systems  with 
Cauchy  distributed  noises  appears  to  be  intractable.  Therefore,  we 
instead  develop  an  m-step  model  predictive  optimal  controller 
(Morari  &  Zafiriou,  1989),  i.e.,  open-loop-optimal  feedback  con¬ 
troller,4  for  such  systems.  Some  initial  results  on  scalar  Cauchy 
model  predictive  control,  the  Cauchy  controller,  were  presented 
in  Idan,  Emadzadeh,  and  Speyer  (2010)  and  Speyer,  Idan,  and 
Fernandez  (2010).  The  current  work  presents  a  more  complete 
scalar  Cauchy  m-step  optimal  model  predictive  solution.  Although 
formulating  an  m-step  optimal  predictive  controller  produces  a 
suboptimal  solution,  the  performance  index  is  evaluated  by  de¬ 
termining  the  conditional  expected  value  of  the  objective  function 
given  the  measurement  history  exactly  in  closed  form.  This  Cauchy 
controller  is  expressed  as  a  nonlinearfunction  of  the  measurement 
history. 


The  in-phase  component  of  radar  clutter  time  series  agrees  extremely  well  with 


The  scalar  state  problem  addressed  in  this  work  provides  in¬ 
sight  into  the  problem  of  handling  outliers,  which  is  resolved  by  the 
Cauchy  stochasticcontrollerexplicitly,  and  not  in  thefilteralone,  as 
has  been  traditionally  done  for  Gaussian  noises.  There  is  no  equiv¬ 
alent  construction  of  such  a  Cauchy  controller  in  the  non-Gaussian 
stochastic  control  literature,  including  in  the  stochastic  model  pre¬ 
dictive  control  (MPC)  setting.  In  stochastic  MPC,  the  objective  is 
to  minimize  a  quadratic  objective  function  subject  to  probabilis¬ 
tic  inequality  constraints.  In  Jun  and  Bitmead  (2005)  the  assump¬ 
tion  of  a  linear  system  with  additive  Gaussian  noise  is  essential  in 
transforming  the  stochastic  optimal  control  problem  into  a  deter¬ 
ministic  one.5  In  Cannon,  Cheng,  and  Rakovic  (2012)  the  Gaussian 
assumption  is  removed  by  assuming  that  all  stochastic  uncer¬ 
tainties  have  bounded  support.  Our  motivation  is  to  determine  a 
stochastic  controller  based  on  Cauchy  pdf’s  with  infinite  support, 
which  produces  a  deterministic  analytic  performance  index  in  the 
control  and  measurement  history. 

The  Cauchy  controller  gives  insight  into  robustness  by  intro¬ 
ducing  heavy  tailed  distributions  into  the  design.  Moreover,  a  con¬ 
troller  design  process  is  advocated  for  handling  outliers  based  on 
whether  the  measurement  noise  dominates  the  process  noise.  If 
measurement  noise  dominates,  the  measurement  outliers  do  not 
have  to  be  known,  since  the  controller  responds  little  to  them. 

The  paper  is  organized  as  follows.  The  optimal  control  prob¬ 
lem  is  formulated  in  Section  2.  A  new  objective  function  is  intro¬ 
duced  in  Section  3  having  the  functional  form  of  the  Cauchy  pdf. 
The  performance  index  to  be  optimized  is  the  conditional  expecta¬ 
tion  of  the  objective  function.  A  dynamic  programming  recursion 
rule  is  derived  in  Section  4.  Since  the  solution  to  the  dynamic  pro¬ 
gramming  problem  appears  intractable,  the  Cauchy  m-step  model 
predictive  optimal  controller  is  formulated  and  developed  in  Sec¬ 
tion  5.  The  conditional  expected  value  of  the  objective  function  is 
determined  in  closed  form.  However,  due  to  its  complexity,  the 
maximization  oftheconditional  performance  indexwithrespectto 
the  projected  control  sequence  is  determined  numerically,  as  de¬ 
tailed  at  the  end  of  Section  5.  Numerical  examples  are  presented 
in  Section  6.  First,  a  one-step  process  with  a  single  measurement  is 
first  explored  to  gain  insight  into  the  structure  and  behavior  of  the 
Cauchy  controller  in  the  presence  of  impulsive  heavy-tailed  noises 
discussed  in  Section  6.1 .  Then,  multi-step  examples  are  given  in 
Section  6.2  comparing  the  performance  of  the  Cauchy  and  Gaussian 
m-step  model  predictive  optimal  controllers  under  both  Cauchy 
and  Gaussian  noises.  Finally,  concluding  remarks  are  presented  in 
Section  7. 

2.  Problem  statement 

Considerthe  linear,  discrete-time,  scalarstochasticsystem 

Xfc  =  0xk-i  +  Ufc— i  +  Wk-1,  zk  =  Hxk  +  \k,  (1) 

where  xa  is  the  state,  ua  is  the  control  signal ,  z;t  is  the  measurement, 
and  k  is  the  time  index.  The  signals  wk  and  vk  are  independent 
process  and  measurement  noise  sequences,  respectively,  that  are 
assumed  to  be  independent  of  each  other  and  Cauchy  distributed 
with  pdf's 


(2) 


a  SO-S  pdf  with  (7  =  1.7  (Tsakalides  &  Nikias,  1998). 
3 


A 


In  this  paper,  we  will  compare  our  Cauchy  controller  to  the  negative  LEG 
controller. 

^  Open-loop-optimal  feedback  was  introduced  in  the  stochastic  control  context 
by  Dreyfus  in  Dreyfus  (1965) 


P/h  f  ,  _  y/n 

fSw<)  =  ~ FP  h(Vk)  ~ 
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— TZ - 

Moreover,  the  solution  in  Jun  and  Bitmead  (2005)  is  exact  only  for  the  scalar 
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These  pdf's  have  a  median  of  zero  and  scale  parameters  /3  >  0  and 
JfisfritfctiSf  Pect'vely-  The  initial  state  is  assumed  to  be  also  Cauchy 


a/n 


fx0(xa)  —  .  o 

(* 


-  Xo)2 


+  a2 


with  a  median  at  x  o  and  scale  parameters  O  >  0. 
Define  the  measurement  history  as 

Zk  ■  {zo,  *  •  •  /  Z k}/ 


(3) 


(4) 


and  assume  that  €  Ft,  where  F/t  is  the  class  of  functions  adapted 
to  the  O -algebra  Ok  generated  by  Zk.  The  notion  of  adaptation 
is  that  the  random  variable,  here  the  control,  is  measurable  with 
respect  to  events  in  Uk. 

In  this  work  we  consider  a  regulation  problem,  where  the  state 
is  to  be  driven  to  zero  while  penalizing  the  control  effort.  This  prob¬ 
lem  will  be  cast  as  an  optimal  control  problem,  the  performance 
index  of  which  will  be  introduced  next. 


function,  discussed  later,  as 


A 


Pi 


(//  X" 

1+1/  =  Mx(xi+i)Mu(ui) 


d 


2  7 

f],  + 1  ■  £>2 

—2 -  2  -2-? - i 


□ 


(6) 

i  =  t  Xi+1  4"  Hi  + 1  U;  + 

with  i  <  n.  The  state  and  control  histories  used  in  the  objective 
function  are  defined  as 

X{  :=  {Xf/ . .  .,x„},  (7a) 

Ut  :=  {uf, . . . ,  u„}.  (7b) 

Since  the  performance  index  is  composed  of  the  expected  value  of 
bell-shaped  curves  in  (6),  we  maximize  to  obtain  regulation  of  the 
state  with  penalties  on  the  control.  The  performance  index,  which 
is  the  expectation  of  the  objective  function  in  (6),  is  given  by 


)(  ,,  =  max£  ip 


f+i 


(8) 


3.  Performance  index  for  a  Cauchy  controller 

In  posing  an  optimal  control  problem  for  the  model  in  (1), 
commonly  used  objective  functions  like  the  quadratic  or  the 
exponentialofaquadraticcannotbeusedbecausetheexpectations 
required  to  evaluate  those  objective  functions  are  infinite  when 
the  system  noise  inputs  have  heavy-tailed  Cauchy  pdf’s.  Therefore, 
one  has  to  introduce  a  new,  computable  objective  function.  In  this 
work  we  suggest  an  objective  function  that  resembles  in  its  form 
the  Cauchy  pdf,  and  which  also  allows  an  analytical  derivation 
of  the  controller.  This  general  objective  function  is  reminiscent 
to  the  choice  of  the  objective  function  for  the  LEG  (Speyer  & 
Chung,  2008,  Chapter  10),  which  was  constructed  as  a  product 
of  functions  resembling  the  Gaussian  pdf.  The  original  motivation 
for  the  LEG  objective  function  (Jacobson,  1973)  was  to  consider 
these  exponential  functions  as  membership  functions  in  fuzzy  set 
theory,  where  the  objective  function  was  constructed  as  a  product 
of  these  Gaussian-shaped  membership  functions.  Similarly,  a 
objective  function  constructed  from  products  of  Cauchy-shaped 
membership  functions  is  proposed  here  for  systems  with  Cauchy 
noises.  This  similarity  allows  us  to  compare  the  optimal  controllers 
for  linear  systems  with  analogous  performance  index  and  different 
types  of  noise. 

Consequently,  the  membership  functions  that  penalize  the  state 
and  control  are  chosen  as  rational  functions  resembling  Cauchy 
pdf’s  and  are  expressed  as 

A 


My(Xk)  = 


nl 


M„(uk)  = 


L  A 


(5) 


Xk  +  Ik  ut  +  4 

where  r/k  and  <+,  are  design  parameters  which  affect  the  shape  of 

these  functions.  Note  that  as  r)k  — •  00  or  £k  —  00,  the  penalties 
on  the  state  or  the  control  are  removed.  In  addition,  smaller 
values  of  r/k  and  £k  induce  heavier  weighings  on  the  respective 
variables.  These  particular  functions  are  chosen  because  they  make 
the  expectation  with  respect  to  the  conditional  pdf  generated 
by  Cauchy  noise  of  the  resulting  performance  index  analytic  in 
the  control  and  measurement  history  as  will  be  shown  in  the 
sequel. 

Ideally,  for  regulation,  we  would  like  to  solve  an  infinite  horizon 
regulation  problem.  This  is  often  done  by  solving  a  fixed  horizon 
problem  from  k  =  0  to  N,  e.g.,  using  the  dynamic  programming 
approach,  and  then  letting  N  go  to  infinity.  However,  we  define  a 
more  general  objective  function  that  will  serve  both  the  dynamic 
programming  formulation  and  the  model  predictive  objective 


where  element  uk  in  U"  1 ,  the  control  sequence,  is  €  Ft.  The 
expectation  is  taken  over  all  the  underlying  random  variables,  i.  e., 
all  the  measurement  and  process  noises,  and  the  initial  conditions. 


4.  Dynamic  programming  approach 

To  solve  the  problem  in  (8),  we  first  consider  the  dynamic 
programming  approach  with  the  hope  of  constructing  an  analytic 
form  of  the  controller.  It  requires  the  derivation  of  the  general 
optimal  return  function  and  the  backwards  recursion  rule,  which 
are  derived  next.  This  derivation  shows  the  complexity  of 
evaluating  the  backwards  recursion  rule  for  Cauchy  noise  patterns 
and  motivates  the  derivation  of  an  alternative  model-predictive- 
type  controller. 

In  this  derivation,  we  consider  a  given  fixed  horizon  problem 
by  setting  i  and  n  of  (8)  to  be  l  =  0  and  n  =  N.  Expanding  the 
expectation  cHi  tBi  right-hanEfiide  of  (8)  results  in 

o, n  =  max  £  |l  Xt  /  U0 

m 

1  0  D 


N-1 

U  0 


maxE 

,.N- 1 


N 

N  N- 1 


=  max  £  £  ...£  E  ip  X17  U0  ^ZNUZN_,  .  .  .Dz0 

U°D  □ 

□  □  □  „ 

max  E  E  ip  x 

t=c  '  / 

m 


=  E  max  E 

uoeFo  uN_1eFN_1 

v-iO  I  7  □  I  z  D  ,7 

ZN  I  Z-N-1  ■  ■  ■  Zo 


N- 

u  0 


□  □  □  00 

I— I  OO 

max 

—  oo  uqGFo  -00 


□ 


max 

uN-1GFN-1  _c 


Uq  ^  I ZN  An\ aN-i  (ZN  I  ZN- 1) 

□ 

’  fz  1 1 oq  ( Zi  |  ~Z- 0 )  dz 1  fzQ  (zo)dzo, 


) 

□ 


(9) 


where  element  us-  in  U"  1  is  ut  €  F/,,  and  fz  a  (zk \Zk  1)  is  the 

0  kilc-"]  ~ 

probability  density  function  conditioned  on  the  <7t  -i  generated  by 
the  measurement  histories  up  to  k  —  1 .  The  Fundamental  Lemma 
(Speyer  &  Chung,  2008,  Chapter  9)  was  used  to  interchange  the 
maximization  and  expectation  operations  in  the  third  equality 
of  (9). 
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To  apply  dynamic  programming  to  the  problem  in  (9),  we  need 
to  recast  the  performance  index  as  a  recursion  rule  for  an  optimal 
return  function.  To  get  this  recursion,  first  note  that  the  optimal 
performance  index  starting  at  time  step  k  +  1 ,  which  is  embedded 
in  (9), 
maxE 

i  o 


u  "  eD„4<- 


J.L.  Speyer et  al.  /  Automatica  xx  (xxxx)  xxx—xxx 

5.  Cauchy  optimal  model  predictive  controller 


□  oo  □  LJ 

max 

max 

u,  ,  1  FR  ,  H  uN-1eFN-1 

u7:+1tr7c+1  —  oo 

□ 

ur1  i 

Zn  fz  nI°N-1  (zN  | Za/  — 1 

'  fzk+2 

|Ojc+i  (zk+ 2  |Zfc+1  )  dZk+2‘ 

(10) 


Using  the  above,  the  performance  index  (9)  can  be  restated  as 

□  □»  D 


□ 


□  D  □  nN 

u  00  max  E  UJ  XN 


I(tN  -  max  ■  ■  ■  imcia  1 1  ioa  u  y  ,x  i 

'  '  N  1  1 

n-oo  uopfo  oo  uk(EFk  -00  ,J- 

U"-1  |Z *  /  (zk  1  |Z  ) 

o  +i  zk+i\(Jk  Qf  *[]  _ 


■  fii\o0  fa  |ZoJ/z0  (zo)  rfzi'+i  ■  ■  ■  dz  1  dzo- 


(11) 


Pro9ramrning  solution,  define  the  opti- 

"  ,  □  □  m  □ 

/c+i,n  (Zfc+v  t!.  max  E  ip  X1 ,  Uq  Z+i 
7*  N  N-1  Q 


^/2t+ll0k  (zk+ 1  \zk)  ■  ■  '/Z1|0o  fa  \Z0)f,,  (zo) , 


(12) 


which  is  the  integrand  of  the  most  inner  integral  in  (1 1).  Then,  the 
sought  after  dynamic  programming  recursion  rule  becomes 

□  =0  _ 

1\,n  (z>)  =  max  jl+,'N(Zk+i)dzk+i.  (13) 

This  is  the  general  form  for  the  recursion  rule  from  which  the  op¬ 
timal  performance  index  is  determined  by 

r*  ^ 

0,N  =  Jo,N  (Zo)  dz.Q.  (14) 


A  method  for  obtaining  the  conditional  pdfs  needed  to  evaluate 
the  expectations  in  (12)  is  presented  in  Idan  and  Speyer  (201 0)  and 
reviewed  in  Appendix  A.  It  can  be  verified  that  the  functional  de¬ 
pendence  of  these  conditional  pdfs  on  the  measurements  is  very 
complex,  making  the  integration  and  maximization  required  in 
(13)  intractable.6  The  main  difficulty  arises  from  the  need  to  av¬ 
erage  over  future  measurements. 

Alternatively,  a  model  predictive  controller  that  does  not  av¬ 
erage  over  future  measurements  and  can  therefore  be  constructed 
more  feasibly.  Although  sub-optimal,  it  will  produce  a  practical  and 
computable  control  solution  for  a  system  with  Cauchy  noises.  This 
paper  considers  an  m-step  horizon  optimal  model  predictive  con¬ 
troller,  which  is  developed  in  the  next  section. 


D  Note  that  for  a  Gaussian  system  with  a  multiplicative  objective  function 
constructed  from  the  product  of  exponentials  with  quadratic  arguments,  (13) 
produces  a  linear  controller  —  the  LEG  controller  (Speyer  &  Chung,  2008,  Chapter 
10).  However,  if  the  objective  function  is  constructed  from  a  sum  of  exponentials 
with  quadratic  arguments,  the  solution  to  (13)  appears  intractable. 


The  Cauchy  m-step  optimal  model  predictive  controller  is 
determined  by  maximizing  a  moving,  fixed-horizon  performance 

index  that  is  a  function  of  all  the  measurements  up  to  the  current 
time  but  no  future  ones.  The  performance  index  is  defined  as  the 

expected  value  of  the  objective  function  in  (6)  using  l  =  k( i.e., 
the  current  time)  and  n  =  k  +  m  t:.  p,  where  m  is  the  size  of  a 
moving  horizon.  The  expectation  is  carried  out  overthe  stochastic 
variables  associated  with  the  states  in  this  moving  horizon,  i.e., 
the  future  process  noise,  and  the  current  measurement  history.  As 
opposed  to  dynamic  programming,  here  we  define  the  predictive 
control  sequence  as  U'  ,  where  every  element  in  it  is  in  F  ; 

k  k 

i.e.  u(  e  Ft  V  t  =  k, ...  ,p  —  1.  Hence,  the  performance  index 
is  given  as 

□  □ 

74  =  max  E  ip  X{+uUk  ,  (15) 

u  k~ 

where  £  [  ■  1  is  the  expectation  .taken  over  Zk  and  Xp ...  it  is 
„  f ' 7  of  6)  is  boundedlb/oTie for 

M^n^  C^te  M^rmance  index  /*,,,  =  n 


ur1 


m  c1  a  v/ 

.  k  of  (15)  always  exists  and  is  finite. 

The  optimal  performance  index  of  (1 5)  is  restated  as 

/*  d  □  □  Ufa  m 

**  =  mfai£  E  v  <M/ur 


k+ 1 
V 


UtD  □  □ 

=  E  maxE  ip 

.  .r-1 


I  IP-1 

X?+1'  Ut 


□ 

mo  □ 

t:.E  I*  , 

k  Zk 


(16) 


where  the  Fundamental  Lemma  (Speyer  &  Chung,  2008,  Chapter 
9)  is  used  to  interchange  the  maximization  and  expectation 
operations.  The  current  control  is  found  as  a  function  of 
the  current  information  pattern  that  includes  both  current  and 
past  measurements  and  the  past  control  inputs.  An  important 

characteristic  ofthe  objective  function  ip(- ,  ■ )  is  that  for  the  Cauchy 
densities  the  conditional  performance  index  jzk  of  (16)  can  be 
determined  in  a  closed  form  and  the  optimal  control  signal  can  be 
determined  by  maximizing  this  expression. 

Up  until  now  we  have  considered  that  the  control  ut  is  adapted 

to  the  U -algebra  Ok  generated  by  the  measurement  history,  Zk. 
However,  if  the  state  is  decomposed  into  a  dynamic  system  that 
contains  all  the  underlying  random  variables  and  another  dynamic 
system  only  driven  by  the  control,  then  the  determination  of  the 
optimal  control  is  not  only  simplified,  but  indeed  tractable.  Given 
this  decomposition,  we  show  that  the  control  has  to  be  adaptive 
to  only  the  cr-algebra  generated  by  the  measurement  history  of 
the  decomposed  state  associated  with  the  underlying  random 
variables. 

Consider  the  linear,  discrete-time,  scalar  stochastic  system  of 
(1 )  with  the  measurement  history  given  by  (4).  Let  u;c  be  adaptive  to 
the  filtration  cr -algebra  Uk  generated  by  the  measurement  history 
Zk.  Filtration  implies  that  the  collection  of  O -algebras  <Jk  have  the 
property  that  if/  <  k,  then  O)  c  <jk  (Fleming  &  Rishel,  1975). 
Therefore,  filtration  is  the  evolution  ofthe  cr-algebra  generated 
by  measurement  history  through  time.  Adaptation  means  that 
the  control  is  a  measurable  function  of  events  on  this  O- algebra, 
i.e.,  this  ensures  that  the  control  sequence  is  causal.  Now  consider 
the  decomposition  xk  =  x  *  +  x  k  where 


x  k  —  ®x  k-i  +  Wjt-1,  z  k  —  Hx  k  +  vk, 
Xk  =  $Xk- 1  +  Ufc-1,  z  k  =  Hxt. 


(17a) 

(17b) 
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10 


Here,  x  k  and  z  k  are  the  state  and  the  measurement  of  the  subsystem 

containing  all  the  underlying  random  variables,  i.e.,  wk,  vk,  and  the 
initial  condition  xo,  which  is  Cauchy  distributed  with  zero  median 
similar  to  (3).  Similarly,  x  k  and  zk  are  the  state  and  measurement 

of  a  dynamic  system  driven  by  ut  with  initial  condition  xo.  The 
initialization  of  either  system  is  arbitrary  and  does  not  affect  the 
simplicity  introduced  by  the  decomposition,  as  long  as  the  sum  of 
the  initial  value  of  (1 7b)  and  the  median  of  x  o  equals  the  given  x _o. 

The  measurement  history  can  be  decomposed  as  Zk  =  Z  k  +  Z  k 
where 


11  Z  t  =  {z  o, . .  .,z  J,  Z  k  =  {za,  .  . .  ,  zj.  (18) 

12  In  the  following  it  is  shown  that  the  control  is  measurable  on  events 

13  generated  by  Z  k  only. 
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Theorem  1.  Consider  the  filtration  <J -algebra  <7  k  generated  by  Z  k, 
with  the  decomposition  Zk  =  Z  k  +  Zk.  For  Z  k  £  <7  k  and  (7  a— i  C 
O  k,Zk  is  adapted  to  <7  a-i  and  ua  is  adapted  to  O  k. 

Proof.  Start  with  k  =  0.  The  initial  state  is  decomposed  as 

xo  =  x  o  +  xo,  where  xo  is  a  given  non-random  parameter.  The 
measurement  decomposes  as  zo  =  z  o  +  zo,  where  zo  =  ETxo  is  a 
given  non-random  parameter  and  z  o  =  Z  o  £  (Jo-  Then,  uo,_which  is 
determined  by  z0,  is  adapted  to  <7 0.  At  k  =  1 ,  both  x  =  Ox  0  +  u0 
and  zi  =  Hx  i  are  adapted  to  <70,  and  thus  Z  i  is  adapted  to  <7o. 
For  the  measurement  at  k  =  1,  z  i  £  <7 1,  Z  i  £  Q  i,  and  <7 o  C  <7 
Hence,  since  ui  is  determined  by  Z^  =  Z  i  +  Z  i,  it  is  adapted  to 

<7 1 .  Recursively  to  any  k,  Z  k  is  adapted  to  <7  a-i  .  With_Z  k  £  (7  k,  and 
(7  i—i  C  <7 k,  uk  that  is  determined  by  Zk  =  Z  k  +  Z  k  is  adapted  to 

dr.  • 

Due  to  the  result  of  Theorem  1,  i.e.,  that  the  control  is  adapted 

to  <7  k,  the  conditioning  on  Zk  can  be  replaced  by  Z  k.  With  these 
substitutions,  the  optimization  or  maximization  step  of  the  model 
predictive  control  problem  is  rotated  as 

□  □  p-i  n 

/!.  =  max£  ^  ^+i/Ut 


_  max  E 

j>_i 

t:.  max/ 

»-1  z k 


□  □ 
o  K+u  ut 


p- 1 


m  o 


t  ■■■ii 


(19) 
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In  the  model  predictive  control  operation  mode,  although  the  op¬ 
timal  control  sequence  is  determined  over  the  prediction  inter¬ 
val  from  k  to  p,  only  the  current  control  input  ua  at  time  step  k  is 
applied  to  the  system.  Then,  at  subsequent  time  steps,  the  perfor¬ 
mance  index  in  (1 6)  is  maximized  again  to  compute  a  new  optimal 
control  sequence,  the  first  element  of  which  is  applied  to  the  sys¬ 
tem. 

The  conditional  pdf  needed  to  evaluate  (19)  can  be  determined 
using  the  results  in  Idan  and  Speyer  (2010).  Next,  we  briefly 
review  those  results  and  show  how  they  are  used  to  express  Jf  k 
analytically. 


46  5.1.  Propagation  of  the  conditional  pdf 


In  Idan  and  Speyer  (2010)  it  was  shown  that  the  conditional  pdf 
discussed  above  can  be  determined  analytically  in  a  closed  form. 

It  was  obtained  by  expressing  the  pdf  as  a  sum  of  terms,  each  of 
which  is  a  rational  function  of  the  random  variable.  The  number 
of  terms  in  this  sum  grows  with  time  and  the  parameters  of  these 
terms  are  updated  during  the  time  propagation  and  measurement 
updated  steps.  The  posteriori  form  of  the  conditional  pdf  given  the 
measurement  history  up  to  the  current  time  is  presented  in  (A.12) 


48 

49 


52 

53 


of  Appendix  A  and  presented  here  for  convenience  as 

D  _  D  Q2  atfk\k)x  i-  +  bfk\k) 

hk\at  xk\Zk\  =  □.  D2  o  '  (20) 

;=1  xt-A(k\k)  +CCh(k\k) 

where  the  conditional  pdf  is  explicitly  conditioned  on  the  a  - 
algebra  6  a  generated  by  Z  k,  the  stochastic  part  (17a)  of  the  de¬ 
composed  system  in  (17).  The  update  and  propagation  of  the 
conditional  density,  expressed  as  the  propagation  equations  for 
a,(k\k),  bi(k\k),  A,(k\k)  and  C0i(k\k),  as  well  as  their  boundary  con¬ 
ditions,  are  given  in  Appendix  A.  Note  thatthe  measurement  enters 

the  conditional  pdf  in  a  nonlinear  way  through  the  Ak+2(k\k)  = 
z  a /H  term  used  in  the  measurement  update  stage.  In  addition,  due 
to  the  state  decomposition  of  (17),  the  conditional  pdf  in  (20)  is 
independent  of  the  control  variable. 

5.2.  Construction  of  performance  index 

The  conditional  pdf  and  its  propagation  relations  reviewed  in 
Appendix  A  can  be  used  to  derive  an  analytical  expression  for 

the  conditional  performance  index  /  in  (19)  as  a  function  of  the 

p- 1  zk 

control  sequence  U,.  .  First,  (G)  is  used  in  (1 9)  to  yield 

□  □ 

]%  =  maxE  V  Xa+v  0  k 
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D  □ 


=  max  E 

p-4 

UA  i=k 


^~|1  m 

Mx(xi  qjM  (»)\^  a 


□  oo  □ 


=  max 

p-i 
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'Ll 


Mx(xi+i)Mu(ui) 


(21) 


X/xp.  - -x\  i  |a k  x  p/ •  •  •  /  x  k+v^- k\  dx  p, . . . ,  dx  a+i  . 

u  .  a  .  □ 

Mere,/x”p.  ■  .x"n-i|afc  xpr...,x  a+i|Za|  is  the  conditional  joint  density, 
determined  from  the  one-step  projected  conditional  pdf  of  (A.  17) 

and  the  state  transition  pdfs  that  are  influenced  by  the  process 
noise,  and  is  determined  by 

□.  .  -  □ 

kr--iM\Sk  ^/■■■/Xl+lIZfcl  g 

fx  plx  p-1  1  ‘  ‘X  k+ 1  X  —  1  /  •  *  •  /  X  fc+1  /  Z  ^ 

X  fxp-i  |xp-;j-j-  -x  x  p-1  \X  p-2/  •  *  •  /  *  HI/  Z  k\  1  1  1 
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■/xp|xp  1  X  pi*  p-1  I  /xp  l|xp  2 

y/  D.  □.  l7-  p 

X  /x  a+2  lx  fc+1  Ik+2\xk+t\  /x  A  1  |C  I  k+t  \1  A  F  ' 


□. 


(22) 


while  from  (2)  and  using  the  Markov  property  of  (17a),  the 
transition  probability  is 

k  lx-  *;+ik)  =  - Sir -  (23) 

1  (x^-Of^+p2 

for  j  =  k,  .  .  .  ,  p  —  1 .  Substituting  (22)  into  (21)  we  obtain 

□H  oo  OIL  □  Q 

Af/xjjM/up-  J/x-  p\ i  | 


1%  =  max 

^  k  p-1 

Up  GF  -oo 


D 


tp|Xp-1  Ap|Ap- 

0 


□  _  _  0  _ 

X  dx  p  ■  Mx(xk+])Mu(uk)fxM\dk  *H-i|Zfc|  dx  k+-\  .  (24) 

Due  to  the  rational  polynomial  structure  of  both  the  associated 
pdf’s  presented  earlier  and  of  the  objective  function  ,  •)  in  (6), 
(he  above  integrals  can  be  carried  out  analytically.  However,  for 
presentation  simplicity,  we  will  derive  a  closed  form  result  when 
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the  state  only  at  the  prediction  horizon  p  =  k+m  is  weighted  in  the 
objective  function  of  (6).  This  is  obtained  by  letting /7,+i  —  oo  ,j  = 
k,  ...  ,p  —  2,  which  yields  Ma(x,+i  )  =  1, ;  =  k,  .  .  .  ,  p  —  2.  In  this 
case  the  conditional  performance  index  to  maximize  becomes 

/*  U  D”  .  .  0.0 

M„( \ij)  Mx(xp)fx p\g t  x  pjz  j 

=k  “ 


=  max 

t-  k  p  1 

Ul 


(25) 


□. 


Theconditiorjpl  pdf/ip|crt  x  P\Z  k\  is  obtained  by  time-propagating 
x  t\Z  t\  of  (20)  m  times  using  (A.17)-(A.21)  to  yield 


□. 


D  P2 _ ai(p\k)x  ,,  +  bfplk ) 


fxpl&k  Xv\^-  tl  □.  ,  ,  ,  ,D 

1  =  1  X 


(26) 


f~Mv\ *r  ~ + °^(v\k) 

The  parameters  a, (p \ k ),  bi(p\k),  Aj(p\k)  and  COi(p\k)  are  expressed 
explicitly  using  (A.18)-(A.21).  The  integral  in  (25)  is  evaluated 
analytically  as 


□ 


□. 


□ 


g(xp)  =  MjfxpJ/x-piffj  xr\Zk\  dict 

_ _ OO 

□  00  n2  k+2  a  fy\k)x  +  b  (v\ k) 


nt 


dx  1 


sequence  that  yields  this  particular  x  *  is  not  unique  (clearly  it  is 
unique  when  m  =  1.)  One  such  sequence,  which  assumes  that 
Uj,  j  =  k, ...  ,p 

x*  -0 


1  is  constant,  is  given  by 


Xk 


-1 


j  =  k,...,p-  1. 


(29) 


□ 


0i 


Nextvye  use  homotopy  to  incorporate  contwl  penalties  back  into 
tne  performance  index  For  fh at,  trie  weights  4 ),  f  =  k, .  .  .  ,p  —  1 

are  set  to  very  high  values,  thus  imposing  a  small  change  in  the 
value  of  the  performance  index.  Starting  with  an  initial  guess  for 
uj,  j  =  k,  ...  ,p  —  1  as  in  (29),  the  maximization  of  Jz  is  carried 
out  numerically  using  the  accelerated  gradient  method  (Fletcher 
&  Powell,  1 963;  Myers,  1 968)  as  a  refinement  step.  The  process  is 
repeated  for  decreasing  <^s  until  reaching  their  design  values. 

The  gradient  needed  for  the  accelerated  gradient  method  in 

the  refinement  step  in  the  homotopy  optimization  method  can 
analytically  by  applying  the  chain  rule  to  (28).  The 


be  computed  ar 


partial  of  the  performance  index /7~ 

0  Jr  1  k 


with  respect  tout  is  given  by 


dJz 


v- 1 
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+  x  f  +  q2 


-A  (p\kp*  +  02(p\k) 


V  i=  1  p 


= 


=  nr), 


,[]  Ui(p\k)xv  +  bi(p\k) 


.....  r  ...  -  dx 
(xp  +  xp)2  +  q2  _  Q  2  p 

rj  p  xv-Ai(p\k)  +0);(p\ k) 

^  1  a  fx  +  A)n  +  (b  -  ax  )(n  +  (O  )  . 

ip  ip  i  i  P  P  i 

(x„  +  A,)2  +  (nP  +  Od,)2 


(27) 


where  in  the  last  expression  the  functional  dependence  of  the 
parameters  a,,  b„  Aj  and  CO,  on  the  time  index  (p\k)  is  removed  for 
brevity.  Using  this  resultin  (25)  yields  the  following  maximization 
problem 

d  &  D  r*) 

„2  1  2  g(x  )■ 


HFf 

U, 


(28) 


j=k  j 


+  C- 


p-i 


The  dependence  of  /z  on  the  elements  of  Ufc  is  given  explicitly 
in  the  first  term  of  (28)  and  implicitly  in  the  second  term  via  x  v, 
which  is  obtained  by  propagating  (17b)  m  times  using  the  control 
sequence  Up~: ;  hence,  x  is  a  function  of  an  open  loop  control 
policy  from  time  k  to  time  p  —  1,  and  xp  is  the  maximizing  value. 
Note  that,  as  with  the  apriori  conditional  pdf  in  (20),  the  number 
of  terms  that  determine  the  performance  index  in  (28)  grows 
with  k. 

The  conditional  performance  index  in  (28)  is  an  extremely 
complex  function  of  the  m  elements  in  Up_1.  Consequently,  there 
is  no  analytical  solution  to  the  associated'  maximization  problem. 
Hence,  a  numerical  procedure  is  proposed  next  to  determine  the 
optimal  control  signal  at  each  time  step  k.  Since  the  numerics 
use  realizations  where  random  variables  take  on  specific  values, 
regular  fonts  are  used  hereon  for  the  state  and  control. 

5.3.  Homotopy  optimization  for  maximizing  Jf 

To  solve  the  maximization  problem  in  (28)  using  the  homotopy 
method,  we  first  address  the  problem  when  the  control  variable 
is  removed  from  the  performance  index,  attained  by  letting  — 
oo,  j  =  k,  ...  ,p  —  1.  In  this  case,  /z-  becomes  a  sum  of  rational 


£ 


ui 


/M 

□ 

X  01’ 


+  Ci 


-i  dgfxp) 


□  22m‘  0 zCxp) 

Ui  +  Q 


(30) 


for£  =  k,  . .  .  ,p  —  1,  where  g(xp)  is  given  in  (27)  and  its  gradient 


dg(xp)  =  np ^ 

d  x  p 


□  □ 
a2  %  t 


D.2  □  □  ^ 
-  A,  +  /]„  +  «, 

: q 


+  aj2  +^n?  +  0 


i=l 

a  . 

2  xp  +  Ai  HiAii)p  +  bi  rip  +  co, 

□  2  : 

CO,  (xp+Ai)  +  (rjr  +  co,) 


Q 


(31) 


Although  an  analytic  form  of  the  second  derivative  of  g(xv)  1 
determined,  it  involves  a  sum  of  terms  that  increases  the  co 


64 

65 


can  be  68 

i  COmpU-  69 

tation  time.  Thereby,  we  use  an  accelerated  gradient  method  that  70 

iteratively  estimates  the  Hessian  matrix  of  the  performance  in-  71 

dex  from  an  initial  guess,  and  by  performing  a  sequence  of  one-  72 

dimensional  searches.  Each  such  search  maximizes  the  function  73 
along  a  specific  search  direction,  determined  by  the  gradient  and  74 

the  current  estimate  of  the  Hessian.  The  search  result  is  used  to  75 

update  the  estimate  of  the  Hessian,  which  is  then  used  to  deter-  76 

mine  a  new  search  direction  orthogonal  to  the  previous  one.  For  a  77 

quadratic  performance  index  this  second  order  method  converges  78 

to  the  optimum  in  n  steps,  where  n  is  the  number  of  variables  79 

being  optimized  over  (Fletcher  &  Powell,  1963;  Myers,  1968).  Al-  so 

though  the  Cauchy  performance  index  is  not  quadratic,  it  isapprox-  si 

imately  quadratic  locally  around  the  maximum.  Therefore,  once  a  82 

point  sufficiently  near  the  maximum  is  found,  the  accelerated  gra-  83 

dient  method  will  find  this  maximum  easily.  Tofind  this  start  point  84 

forthe  accelerated  gradient  method,  the  “homotopy  optimization 

method  uses  a  grid  search  to  find  the  optimal  x  *,and  then  finds  a  86 

predictive  control  sequence  to  get  from  x  *  to  x  *p  From  that  point,  we  87 

do  a  simple  gradient  search  to  get  to  the  quadratic  neighborhood  88 

before  using  the  accelerated  gradient  method  to  find  tne  optimum. 

45  deterministic  state  propagation  in  (17b),  for  m  >  1  the  control 


be  used  to  determine  the  global  optimum,  x*.  Since  there  are 
no  control  penalties,  this  global  optimum  is  focused  on  keeping 
the  predicted  state  small  for  regulation.  Moreover,  due  to  the 


89 


To  obtain  insight  into  the  properties  of  the  Cauchy  stochas¬ 
tic  controller,  the  one-step,  one-measurement  example  is  first 


6.  Numerical  examples 


90 
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Zo 

(a)  a  =  0.1,  /3  =  0.02,  Y  =  0.5,  q  1  =  0.7,  Co  =  °°- 


zo 

(b)  a  =  0.5,  /?  =  0.02,  Y  =  0.1,  f/i  =  0.7,  Co  =  oo. 


Fig.  1.  Contour  plots  of  the  performance  index  for  the  Cauchy  one-step  controller. 


analyzed  in  Section  6.1.  Then,  multi-step  numerical  simulation 
results  are  given  in  Section  6.2,  which  illustrate  the  Cauchy 
controllers  behavior  in  the  presence  of  dominant  Cauchy  measure¬ 
ment  noise ,and  then  .dominant,, Cauchy  process  noise.  The  nerT 
formance  of  the  Cauchy  controller  is  compared  with  a  somewhat 

standard  scheme,  here  the  model  predictive  linear-exponential- 

Gaussian  (LEG)  controller.  Although  dynamic  programming  pro¬ 
vides  a  closed-form  solution  to  the  LEG  problem  (Speyer  &  Chung, 


The  optimal  controller  can  be  obtained  by  maximizing  (32)  with 
respect  to  uq.  The  necessary  optimality  condition,  dj*  /du q  _  0, 

reduces  to  finding  the  roots  of  the  fifth-order  polynomial 

l5u5  4  3  2 

o  +  UuQ  +  Z3W0  +  I2UQ  +  Z1W0  +  Zo  —  0,  (33) 

where 

/  =  1,  l  =  3.5z~  ,  1  =('5.2315z"2 


derivation  of  the  LEG  m-step  controller,  used  here  for  comparison, 

two  classes  of  jodf’s,  and  thus  the  objective  functions,  is  given  in 
Appendix  B.1  and  used  in  the  subseguent  examples. 

6.1.  One-step,  one-measurement  example 

To  explore  the  characteristics  of  the  proposed  Cauchy  con¬ 
troller,  in  this  section  we  evaluate  numerically  its  simplest  form. 
Specifically,  we  examine  the  value  of  the  optimal  control  signal  at 
k  =  0,  i.e.,  u£(z  oj,  as  a  function  of  the  first  measurement  z~o,  that 
varies  due  to  the  measurement  noise  V0  (Idan  et  al.,  2010),  while 
considering  a  one  step  horizon,  i.e.,  m  =  1 .  The  parameters  for  the 
system  and  Cauchy  signals  are  first  chosen  as  O  =  1,  H  =  1,  Q  = 
0. 1,  /?  =  0.02,  y  =  0.5,  and  xo  =  0.  This  example  represents  the 
case  where  the  uncertainty  in  the  initial  condition  is  smaller  than 
the  measurement  noise,  i.e.,  O  <  y  .  To  explore  the  effect  of  higher 
uncertainty  in  the  initial  conditions,  also  the  case  for  which  a  >  y 

is  considered  by  choosing  the  values  O  =  0.5  and  y  =0.1. 

Initially,  no  penalty  is  introduced  on  the  control  signal  in  (28), 
i.e.,  <f0  —  00 ,  while  the  state  at  k  =  1  is  weighted  with  /7-1  = 
0.7.  Substituting  the  system  parameters  into  (28),  the  performance 

index  becomes 

0.1148('4.1667z~2  -  1.0163i(0z  0  +  1) 

(£o2  +  O.iejfi/2  +  0.6724; 

0.03416f7.582(g2o+  3.4153hqz"o^-  l)  {32) 

(702  +  O.16j  (uo  +  z  o)2  +  1-22 


5  4  0  3 

l2  =  ( 3.6806z~2  0 


0  +  3.6681,) 


0  +  6.6305jz~  ,  (34) 

U  =  C0.9491z~i4-  3.2124z~o  +  2.9623,), 

/o  =  (0.07782z  2  0 

0  +  0.3992jz~  . 

This  fifth  order  polynomial  always  has  at  least  one  real  root.  If  three 
roots  are  real,  then  there  are  two  local  maximum  values  and  the 
larger  of  the  two  gives  the  optimal  control.  A  similar  expression, 
with  different  numerical  values,  can  be  attained  also  forthe  O  >  y 
case. 

Contour  plots  of  the  performance  index  for  the  two  case 
discussed  above  are  shown  in  Fig.  1,  with  expanded  views  given 
in  Fig.  2.  In  both  cases  it  is  observed  that  the  performance  index 
has  two  ridges,  with  one,  marked  by  the  dashed  line,  being 
dominant.  This  indicates  that  the  solution  of  (33)  had  two  maxima, 
with  the  higher  one  being  the  global  maximum.  These  contours 
demonstrate  a  clear  difference  in  the  optimal  control  action  forthe 
two  cases  examined.  For  the  first  case  of  O  <  /.shown  in  Figs.  1(a) 
and  2(a),  the  optimal  control  action  u*  i^  a  nonlinear  function  of  z  0 


and  is  almost  aligned  with  the  uo  ~  0  line.  In  the  center  is  an  ellipse 
shaped  contour  whose  major  axis  is  oriented  between  the  two 
ridges.  This  demonstrates  that  when  the  measurement  uncertainty 


is  dominant,  the  optimal  control  action  is  minimal.  The  optimal 
control  strategy  changes  drastically  for  the  a  >  y  case  depicted 
in  Figs.  1  (b)  and  2(b).  In  this  case,  the  global  optimum  generates 

an  approximately  linear  relation  between  u*  and  z~o,  indicating 
that  a  significant  control  action  is  adopted  when  the  measurement 

uncertainty  is  decreased. 

This  difference  in  behavior  cannot  be  deduced  by  examining 
only  the  conditional  variance  and  conditional  mean  of  the 
estimation  error  at  k  =  0,  which  are  given  in  Idan  and  Speyer 
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Fig.  2.  Zoomed  in  contour  plots  of  the  performance  index  for  the  Cauchy  one-step  controller. 


Fig.  3.  Cauchy  (solid  line)  and  Gaussian  (dashed  line)  one-step  controller  with  parameters  variations  in  a  and  y . 


Fig.  4.  Cauchy  (solid  line)  and  Gaussian  (dashed  line)  one-step  controller  with  parameters  variations  in  fS  and  /71. 


□ 


□ 


(2008)  as 

□  .  „,2rD  U  z"o 

E  fxo  —  x  oy  |z  0  =  ay  "  +1  ,  (35) 

(0  +77 

where  x  o  =  ay z  o/(o  +  y ).  This  conditional  variance  represents 
the  uncertainty  in  the  state  estimation  and  grows  with  z  20  Eq. 
(35)  shows  that  the  values  of  the  conditional  variance  and  the 
conditional  mean  do  not  change  when  interchanging  the  values 
of  y  and  O.  Hence,  the  change  in  the  control  strategy  is  correctly 
deduced  by  the  optimal  Cauchy  control  scheme.  Specifically,  it 
correctly  reduces  the  control  effort  when  a  <  y,  i.e.,  when  a 
measurement  noise  impulse  is  more  likely  than  a  large  impulse  in 
the  initial  condition.  Alternatively,  a  nearly  linear  control  action  is 
produced  when  O  >  y ,  i.e.,  the  controller  effectively  forces  the 
state  back  toward  the  origin  when  the  measurement  uncertainty 
is  relatively  small. 

To  obtain  further  insight,  the  performance  of  the  Cauchy  con¬ 
troller  is  compared  to  the  LEG  controller,  presented  in  Appendix  B. 
First,  in  Fig.  3(a),  the  optimal  Cauchy  and  LEG  control  signals  are 
plotted  versus  the  measurement  z  o  for  the  a  <  y  case  with  dif¬ 
ferent  values  of  y .  In  this  case,  for  the  LEG  controller,  the  optimal 
control  is  given  by  u*  =  —x'o  and  therefore,  it  is  linear  in  z  o.  Fig.  3(a) 
shows  that  the  Cauchy  controller  is  symmetric  about  z  o  =  0. 
Furthermore,  the  Cauchy  controller  approaches  zero  when  |z~o|  be¬ 
comes  large,  while  the  Gaussian  controller  remains  linear  with 
respect  to  the  measurement  z  o.  This  is  a  significant  difference  in  be¬ 
havior  between  the  Cauchy  and  Gaussian  optimal  controllers  that 
can  be  deduced  analytically  from  (33).  If  u*(z  o)  is  finite,  the  dom¬ 
inant  term  in  (33)  as  |zo I  —  °°  is  Uu*(oo),  or  liB|z-0|-.oo  u*(zo) 
—  0. 


Fig.  3(b)  examines  the  case  where  y  <  a  for  different  values 
of  a.  In  this  case  also  the  Cauchy  controller  is  nearly  linear. 
Interestingly,  the  slope  of  the  Cauchy  controller,  which  is  hardly 
effected  by  the  change  O,  is  nearly  identical  to  that  of  the  LEG 
controller. 

The  effect  of  parametric  changes  in  /?  and  r/i  for  the  y  >  a  case 
is  explored  in  Fig.  4.  In  this  case,  the  LEG  controller  is  a  single  line, 
since  for£i)  =  oo,  i.e. .with  no  control  weighting,  the  LEG  controller 
is  independent  of  the  parameters  /?  and  rj i  (see  Appendix  B). 
Exploring  this  parametric  change  in  the  y  <  a  reveals  plots  similar 
to  Fig.  3(b),  not  shown  here  for  brevity. 

Now  we  consider  the  cases  with  control  weighting.  Reducing 
<o  from  oo  to  5  has  a  negligible  effect  on  both  the  Cauchy  and  the 
LEG  controllers.  Further  reduction  in  £o,  i.e.,  increase  in  the  control 
weighting,  affects  the  control  strategies  as  depicted  in  Fig.  5.  The 
nonlinear  behavior  of  the  Cauchy  controller  is  retained  for  the 
y  >  a  case  depicted  in  Fig.  5(a),  while  an  approximately  linear 
Cauchy  controller  is  shown  in  Fig.  5(b)  for  the  y  <  a. 

For  y  >  a,  in  the  region  where  z  o  is  relatively  small,  the 
Cauchy  controller  is  approximately  linear  (see  Fig.  5(a)).  However, 
the  Cauchy  controller  in  Fig.  5(a)  for  large  z  o  goes  toward  zero. 
This  is  in  sharp  contrast  with  the  LEG  controller,  which  remains 
linear  in  the  measurement.  Therefore,  the  problem  of  handling 
outliers,  which  occur  for  the  Cauchy  pdf,  appears  to  be  resolved 
by  the  Cauchy  controller  explicitly,  and  not  in  the  filtering  or 
estimation  stage  as  has  been  done  traditionally.  Note  that  the 
controller  design  process  explicitly  uses  the  parameters  y  and 
a  and  hence  their  relative  size  y  >  a,  i.e.,  it  anticipates  more 
impulsive  measurement  uncertainty  than  process  uncertainty.  If 
y  <  a,  the  Cauchy  controller  behaves  approximately  like  the  LEG 
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20 

(a)  a  =  0.1,  P  =  0.02,  Y  =  0.5,  r/1  =  0.7. 


-30  -20  -10  0  10  20  30 

So 

(b)  a  =  0.5,  p  =  0.02,  Y  =  0.1,  q  1  =  0.7. 


Fig.  5.  Cauchy  (solid  line)  and  Gaussian  (dashed  line)  one-step  controller  with  parameters  variations  in  <^o  for  Y  >  O  left  and  y  <  a  right. 


linear  controller  in  Fig.  5(b),  i.e.,  it  expects  more  impulsive  process 
uncertainty  than  measurement  uncertainty.  For  both  the  Cauchy 
and  Gaussian  changes  in  £0  make  only  small  changes  in  the  gain. 

6.2.  Multi-step  numerical  examples 

The  dynamic  characteristics  of  the  Cauchy  optimal  controller, 
obtained  by  maximizing  the  performance  index  in  (28),  are 
explored  through  several  multi-step  numerical  examples.  The 
Cauchy  optimal  control  results  are  compared  against  the  least- 
squares  equivalent  LEG  controller,  obtained  from  (B.12)  and  the 
Kalman  filter  from  (B.10).  The  two  examples  that  are  discussed  in 
this  section  are  a  stable  system  with  a  horizon  length  of  m  =  2  and 
an  unstable  system  with  a  horizon  length  of  m  =  5.  The  param¬ 
eter  values  of  f3  and  y  are  interchanged  to  see  how  the  controller 
performance  changes  when  it  is  designed  for  a  large  measurement 
noise  impulse  in  contrast  to  when  it  is  designed  for  a  large  process 
noise  impulse.  All  simulations  in  this  section  use  either  noise  pa¬ 
rameters/3  =  0. 1,  y  =  0.02  or /3  =  0.02,  y  =  0. 1  and  initial 
condition:  0  =  0.5  and  x  o  =  0.  They  also  all  use  the  same  system 
parameter  of  H  =  1 ,  and  either  0  =  0.95  or  <Z>  =  1.05.  Substi¬ 
tuting  these  parameters  into  (28),  the  performance  index  is  max¬ 
imized  numerically  with  respect  to  the  control  at  each  time  using 
thehomotopy  optimization  method. 

The  performance  of  the  Cauchy  and  LEG  model  predictive 
controllers  are  compared  for  m  =  2  and  using  r]p  =  0.7  as  the 
parameter  for  the  penalty  on  the  state  at  Xk+2  and  ^  =  8,  i  = 
k,k+  as  the  parameter  for  the  control  penalty.  First,  for  y  =  0.1 
and  /3  =  0.02,  when  the  noises  are  small,  the  Cauchy  and  the 
LEG  controllers  exhibit  similar  performance.  However,  they  behave 
rather  differently  when  a  large  measurement  pulse  occurs.  A 
significant  measurement  noise  pulse  causes  a  large  measurement. 
A  measurement  noise  pulse  does  not  represent  a  state  deviation 
and  thus,  for  proper  regulation,  the  controller  should  ignore  that 
measurement.  The  Cauchy  predictive  controller,  designed  for  y  > 
f3,  is  able  to  make  this  distinction,  whereas  the  LEG  predictive 
controller  reacts  linearly  to  all  the  pulses  and  does  not  differentiate 
as  shown  in  Fig.  6(a),  at  around  time  step  k  =  51.  The  Cauchy 
controller  ignores  this  large  measurement  deviation,  applying 
almost  zero  control,  whereas  the  LEG  controller  applies  a  very  large 
control  input  that  causes  the  state  to  deviate  away  from  zero.  An 
additional  control  effort  is  then  required  to  correct  this  deviation. 
In  this  way  the  Cauchy  controller  manages  to  avoid  unnecessary 
actuation  and  thus  maintains  the  system  performance. 

Performance  differences  are  also  observed  when  encountering 
a  large  process  noise  signal.  In  Fig.  6(a),  at  time  steps  k  =  1  and 
k  =  13  process  noise  pulses  occur,  and  although  both  controllers 
react  to  them  and  are  able  to  overcome  this  deviation,  the  Cauchy 
controller  does  so  much  quicker  than  the  LEG  by  applying  a  much 
larger  control  effort.  The  Cauchy  controller  applies  a  larger  control 
because  its  gain  for  small  measurement  values  are  higher  than  that 
of  the  LEG. 

When  the  Cauchy  predictive  controller  is  designed  for  (3  > 

y ,  the  behavior  of  both  the  Cauchy  and  LEGjpontrollers  perform 

A 


similarly  as  shown  in  Fig.  6(b),  demonstrating  the  same  linear 
behavior  seen  in  Figs.  3(b)  and  5(b).  One  effect  of  having  (3  >  y 
is  that  the  state  trajectories  for  the  Cauchy  and  LEG  controllers 
appear  equal  in  Figs.  6(b)  and  7(b).  Even  though  the  controllers 
process  the  measurements  differently,  due  to  the  dominance  of  the 
process  noise  the  controls  and  state  appear  similar. 

Using  the  same  noise  sequence,  similar  results  are  obtained  for 
an  unstable  system  with  m  =  5,  while  using  r]p  =  0.7  as  the 
parameter  for  the  penalty  on  the  state  at  Xk+5  and  =  8 ,  i  = 
k,  .  .  .  ,k  +  4  as  the  parameter  for  the  control  penalty.  These  re¬ 
sults  are  depicted  in  Fig.  7.  For  the  y  >  (3  shown  in  Fig.  7(a), 
the  measurement  noise  impulses  are  more  probable  than  process 
noise  impulses.  Hence,  when  a  large  measurement  impulse  occurs 
at  k  =  51 ,  the  Cauchy  controller  ignores  it,  applying  almost  zero 
control,  whereas  the  LEG  controller  applies  a  very  large  control  in¬ 
put  thaf  causes  the  state  to  deviate  away  from  zero.  For  the  y  <  (3 
case  shown  in  Fig.  7(b),  both  the  Cauchy  and  the  LEG  controllers 
regulate  in  a  similar  fashion. 

Finally,  the  Cauchy  controller  is  examined  in  simulation  with 

Gaussian  noises,  for  which  the  LEG  design  is  optimal.  The  design 
parameters  used  for  the  Cauchy  controller  were  obtained  by  using 

the  least-square  fit  relations  between  the  Cauchy  and  Gaussian 
pdf’s  presented  in  Appendix  B.  Both  a  stable  system  with  m  —  2 

and  an  unstable  system  with  m  =  5  presented  earlier  were  used 
in  these  simulations.  The  results  are  presented  in  Fig.  8,  showing 
also  the  numerical  values  used  in  the  Cauchy  controller  design. 
The  response  of  the  Cauchy  controller  is  very  similar  to  that  of  the 
optimal  (in  this  case)  LEG  controller  for  both  cases.  This  clearly 
demonstrates  the  robustness  of  the  Cauchy  controller,  which 
performs  nearly  optimally  even  in  the  Gaussian  noise  simulation 
for  which  it  was  not  designed. 

7.  Conclusions 

A  new  control  design  paradigm  is  proposed  for  optimal  con¬ 
trol  problems  for  scalar  linear  dynamic  system  driven  by  Cauchy 
distributed  signals.  The  scalar  dynamic  programming  solution  is 
intractable.  Since  the  model  predictive  controller  is  known  to 
have  similar,  though  suboptimal,  performance,  an  m-step  optimal 
model  predictive  controller  was  derived.  The  conditional  expecta¬ 
tion  of  the  new  objective  function  with  respect  to  the  exact  Cauchy 
conditional  pdf  was  determined  in  closed  form.  The  Cauchy  con¬ 
troller  results  were  compared  against  an  equivalent  LEG  m-step 
optimal  modelApredictive  controller.  We  showed  that  both  con¬ 
trollers  behave  in  a  similar  way  when  faced  with  non-impulsive 
Gaussian  noises  that  generate  small  measurement  noise.  However, 
a  dramatic  difference  was  observed  between  the  two  controllers 
when  faced  with  Cauchy  measurement  noise  sequences  with  large 
pulses.  The  Cauchy  optimal  controller  ignores  measurement  noise 
pulses  and  reacts  strongly  to  process  noise  pulses,  effectively  dif¬ 
ferentiating  between  process  and  measurement  noise  and  thereby 
producing  a  scheme  for  handling  measurement  outliers,  jjfhis 
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£  =  8,  m  =  2. 


Fig.  6.  2-step  Cauchy  and  Gaussian  controllers  with  (3  and  y  parameters  interchanged  for  stable  system  in  Cauchy  simulation. 
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Fig.  7.  5-step  Cauchy  and  Gaussian  controllers  with  (3  and  /  parameters  interchanged  for  unstable  system  in  Cauchy  simulation 
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Fig.  8.  Cauchy  and  Gaussian  controllers  for  stable  and  unstable  systems  in  a  Gaussian  simulation. 


ability  of  the  Cauchy  controller  to  reject  outliers  depends  cru¬ 
cially  on  the  dominance  of  the  measurement  noise  overthe  process 
noise.  In  contrast,  the  LEG  controller  is  linearly  proportional  to  the 
measurements  and  always  reacts  to  large  measurement  outliers. 

The  conditional  performance  index  is  a  sum  that  grows  at  each 
measurement  update.  However,  as  justified  in  Idan  and  Speyer 

(2010),  the  number  of  terms  in  the  conditional  pdf  can  be  truncated 
with  minimal  effect  on  the  performance;  in  fact,  the  truncation  of 
old  terms  enhances  numerical  performance  for  unstable  systems. 
This  result  is  important  for  the  feasibility  of  the  Cauchy  model 
predictive  controller  because  it  leads  to  a  computationally  efficient 
implementation.  Finally,  generalization  of  this  scheme  is  being 
resolved  byfirstextendingtheestimatortothemultivariablecase, 
where  the  characteristic  function  of  the  conditional  pdf  must  be 
propagated  (Idan  &  Speyer,  2013).  Then,  Parseval’s  Theorem  is  used 
to  express  the  conditional  performance  index  as  an  integral  over 
the  spectral  variables  involving  the  characteristic  function  of  the 
conditional  pdf  (Fernandez,  Speyer,  &  Idan,  2013).  This  formulation 
also  allows  a  pathway  for  the  generalization  of  the  scalar  stochastic 
controller  developed  here  to  the  multivariable  case. 

Appendix  A.  Algorithm  for  the  propagation  of  the  conditional 
pdf 

This  appendix  briefly  summarizes  the  Cauchy  estimator  derived 
previously  in  Idan  and  Speyer  (2010).  This  estimator  is  central  in 
deriving  the  Cauchy  controller  discussed  in  this  study. 

The  conditional  pdf  of  x  k  given  past  data  Z  (i.e.,  before  the  z  t 
measurement  is  processed)  is  shown  in  Idan  and  Speyer  (2010)  to 
be  expressed  in  a  factored  form  given  by 

iz  |D 

/xfcloVl  k  k- 1  = 


O1 


a/klk-  ^)il  +  bi(k\k-^) 

U  . 

;=  i  xk-At(k\k-l)  +COi(k\k-  \) 


At  the  initial  k  =  0  there  is  only  one  term  in  the  above  sum,  with 
the  initial  parameters  aif0|  —  1)  =  0,  b-\(0\  —  1)  =  d/n ,  Ak(0\  — 

1)  =  0,  and  Cdif0|  —  1)  =  a.  After  an  additional  measurement  z  * 
is  obtained,  a  measurement  updated  conditional  pdf  is  determined 
by 

n  n  □  □ 

fx  o  xt\ik=  *k'zt'z  Tr  .  (a.2) 

TIT  I  7~  073  | 

Jzk\Ok- 1  2k\L  k- 1  I 

The  density  function  in  the  above  numerator  is  computed  as 

□_  _  _  o_  _  o 

fx  k,i k\&k-‘\  *fc/Zfr|Zfc-l|  —  fx  Xfc|Zfc-l|  fvk  Z  k  ~  Hx  k  \ 

=  0  at(k\k-^xk  +bi(k\k-  1) 

tt — ,  ,.Q 


i=i  Tc -A(k\k-1)^  +a>2i(k\k-1) 
Y/n 

xn 


_  — :■ 

2  t-Hxt  +  Y2 


(A.  3) 


i  k  ~  t 

A  simple  assumption  made  in  Idan  and  Speyer  (2010)  guarantees 
that  the  complex  conjugate  roots  in  (A. 3)  of  the  denominator 
polynomial  in  x  k  are  distinct.  Therefore,  using  partial  fraction 
expansions,  (A. 3)  can  be  written  as 

□.  .  ,  -  n  n _ g.(k\k)x  +  b  (k\k) _ 

/  i  ,z  \Z  U  _  i  ^  i  ■  (A. 4) 

*k,*k\&k-1  k  k  *_1 


i  k  i  • 

RrVA (k\kP  2  +C0f(k\k) 


(A.  1 ) 


This  step  introduces  an  additional  term  in  the  pdf  sum  with  four 
parameters.  Ai(k\k)  =  Aj(k\k  —  ij,  0)j(k\k)  =  COi(k\k  —  1  ),i  = 

1,  .  .  .  ,  k+1  represent  the  unchanged  roots  of  the  pdf  denominator, 
while  Ak+2(k\k)  =  z  JH  and  C0k+2(k\k)  =_  y/|  f-f  |  represent  its 
new  complex  measurement  dependent  roots,  a  j(k\k)  and  b  i(k\k) 
^ije  corrjguted  frorr^ie  partial  [fraction  expansions  as 

m(k\k)  a/k\k—  ij 

=  Fi(k)  (A.5) 

bi(k\k)  bi(k\k--\) 
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F,(k)  = 
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(A. 6) 
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(A.7a) 
(A- 7b) 

(A.7c) 


'■'i+2(k\ k) 

Note  that  the  matrices  F,(k),  i  =  1,  .  .  .  ,  k  +  1  are  not  functions  of 

the  numerator  parameters  cij(k\k  —  1 )  and  bi(k\k  —  1 ),  generating 
a  linear,  time-dependent,  stochastic  update  equation  (A.5).  The 
numerator  parameters  of  the  new  term  in  the  pdf  sum  are 


a 


12  ak+2(k\k)=  ai/k+2(k\k) 

i=i 

*□- 

13  b  k+2  (k\k)  =  b  irk+2(k\k), 
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i4  where  for  i  =  1,  .  .  . ,  k  +  1_. 
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(A. 9) 

(A.  10a) 
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is  and 

□  □ 

1?  Firk+2(k)=  2(Ai(k\k)  -  Ak+2(k\k))  1  '  (A.IOb) 

is  Next/Z'  .|(7.  zMZm|  is  determined  by  integrating  the  pdf  in 

19  (A.4)  with  respect  to  x  k.  Due  to  the  modular  structure  of  this  pdf, 

20  the  integral  is  evaluated  analytically  as 


|z  |  = 

/z/cloVi  k  k- 1  0 


i+2  a  (k\k)x  +  b  i(k\k) 
□  ,  CO(k.k) 


(A.  11) 


Finally,  the  conditional  pdf  in  (A. 2)  is  obtained  by  dividing  the 
result  in  (A.4)  by  the  one  in  (A.1 1),  yielding 


fit\6t  Xl\Zk\  - 


tn(k\k)x  kJ-b,(k\k) 


Q  '  ■  (A.  12) 
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25  The  terms  ai(k\k)  and  bt(k\k)  are  obtained  by  dividing  a  i(k\k)  and 
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bt(k\k) 

by  the  result  in  (A.' 
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(A. 12) 

indicates  that  the 

pdf  structure 

defined  in  (A.1) 

is  main- 
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tained  in  factored  form  after  a  measurement  update  with  one  ad¬ 
ditional  term  in  the  pdf  sum.  - 

It  should  be  pointed  out  that  fik\ak  xk\Zk\  of  (A. 12)  could 
be  used  to  compute  the  state  estimate  and  its  estimation  error 
covariance.  This  result  is  not  shown  here,  because  it  is  not  used 
in  deriving  the  Cauchy  controller.  The  interested  reader  is  referred 
to  Idan  and  Speyer  (2010)  for  details. 


Next  we  consider  the  time  propagation  from  step  k  to  k  + 
1.  Given  the  conditional  pdf  in  (A.  12),  our  goal  is  to  construct 

fik+t  \&t  !  i+i  |Z  k[  ,  given  by  the  Chapman-Kolmogorov  equation 
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(A.  16) 


Using  (A.  12)  and  (A.  16),  the  integral  in  (A.  15)  can  be  computed  for 
each  term  in  the  sum  analytically,  leading  to 

□.  .  u 

fik+l\6k  1+  1  I Z  | 


a>(k+  1 1 k)x  FFi  +  bj(k  +  \  \k) 
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where  for  i  =  .  ,k  +  2 

A  (k  +  1| k)  =  0A  (k\k) 


(A.  17) 


(A.  18) 


This  shows  that  the  factored  pdf  structure  of  (A.1)  is  regained  in 
(A.  1 7)  after  a  measurement  update  and  a  time  propagation  step. 

Appendix  B.  Gaussian  model  predictive  controller 

To  compare  the  performance  of  the  Cauchy  controller  with  one 
assuming  Gaussian  noises,  one  has  to  consider:  (a)  the  parameters 

of  the  Gaussian  pdfs  that  best  approximate  the  Cauchy  pdfs,  and 
(b)  the  objective^unction  used  to  design  the  Gaussian  controller  to 
be  comparable  to  the  one  used  in  the  Cauchy  noise  setting.  Those 
two  items  are  addressed  first  in  this  appendix,  which  concludes 
with  the  solution  to  the  Gaussian  controller  problem. 

B.l.  Normal  pdf  least  squares  fit  of  a  Cauchy  pdf 

To  construct  a  normal  or  Gaussian  pdf  that  best  fits  a  given 
Cauchy  pdf,  the  following  optimization  problem  is  solved 
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where  the  Cauchy  pdf  is 
c  P/n 
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jj  >  0 


and  the  normal  pdf  is  given  by 
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Solving  the  integral  in  (B.  1 )  analytically  and  equating  its  derivative 

with  respect  to  E,  to  zero  yields  the  nontjnear  equation 
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(B.4) 


where  K  =  /J/^.  This  equation  clearly  indicates  that  the  optimal  E, 
is  proportional  to  /J,  i.e. ,  Z;*  =  K0/J,  where  Ko  is  the  solution  of  (B.4). 

The  latter  can  be  solved  only  numerically  to  yield  K0  ~  1 . 3898. 
Consequently,  the  equivalent  corresponding  pdf’s  for  the 

Gaussian  optimal  control  problem  are  chosen  as  A 
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B.2.  Linear-exponential-Gaussian  (LEG)  m-step  optimal  model  pre¬ 
dictive  controller 

To  best  approximate  the  Cauchy  objective  function  of  (6)  in 
the  Gaussian  case,  an  exponential  objective  function  is  chosen. 
Since  only  the  dynamic  programming  solution  to  the  LEG  problem 
available  (Speyer  &  Chung,  2008,  Chapter  10),  the  m-step  optimal 
model  predictive  LEG  controller  is  derived  here. 
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Similarly  to  fitting  the  pdf’s  discussed  above,  the  parameters  in  the 
Gaussian  objective  function  can  be  chosen  to  best  fit  the  Cauchy 
objective  function  as 

1  1 

cjp  —  —  r  Vi  —  — j— — /  i  —  k,  ...  ,p  1.  (B.7) 
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K272 
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V  ° 

To  solve  the  maximization  problem  in  (B.6),  Zt  could  be  decom¬ 
posed  as  in  Section  5.  However  in  this  decomposition  does  not  sim¬ 
plify  the  derivation  as  was  the  case  when  determining  the  Cauchy 
controller,  and  hence  is  not  performed  here. 

To  maximize  (B.6)  it  is  sufficient  to  maximize  /Gz-  ■  fjiven  by 
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The  conditional  density  in  (B.8)  is  the  Gaussian  conditional  density 
of  the  state  at  time  p  given  all  measurements  and  controls  up  to 
time  k  and  is  qiven  by 

fxp\CJk  Xp\Z.k  I — I 
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1  (xp-xpj 


(B.9) 


p  of  a  Kalman  filter,  generated  from  the  update  equations  (Speyer 
&  Chung,  2008,  Chapter  9) 


VMjc 
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1  y  +  H2Mk 

and  propagated  to  time  p  from  time  k  using 
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Using  (B.9)  in  (B.8)  and  solving  the  integral  gives 
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The  control  input  sequence  that  maximizes  this  objective  function 
is  found  analytically.  The  control  input  applied  at  time  k  is  the  first 
element  of  the  optimal  control  sequence  u,  found  in  (B.12),  as 


.  *  in'-1 

(QS  S  +  R)  S  Q0  x  k, 


where 
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and  R  is  a  diagonal  matrix  with  n,  i  =  m  along  its  diagonal. 

For  the  inverse  in  (B.12)  to  exist,  it  is  sufficient  that  qp  >  0  and 
that  the  matrix  R  be  positive  definite,  which  means  that  r,  >  0  V i. 
Note  that  in  the  Gaussian  case  the  multiplicative  form  of  the  objec¬ 
tive  function  allows  a  very  simple  analytical  form  of  the  optimal 
control.  The  same  is  true  for  the  dynamic  programming  solution, 
where  the  objective  function  composed  of  sums  of  exponentials 
appears  intractable. 
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Abstract — An  optimal  predictive  controller  for  linear,  vector- 
state  dynamic  systems  driven  by  Cauchy  measurement  and 
process  noises  is  developed.  For  the  vector-state  system,  the 
probability  distribution  function  (pdf)  of  the  state  conditioned 
on  the  measurement  history  cannot  be  generated.  However,  the 
characteristic  function  of  this  pdf  can  be  expressed  in  an  analytic 
form.  Consequently,  the  performance  index  is  evaluated  in  the 
spectral  domain  using  this  characteristic  function.  By  using  an 
objective  function  that  is  a  product  of  functions  resembling 
Cauchy  pdfs,  the  conditional  performance  index  is  obtained 
analytically  in  closed  form  by  using  Parseval’s  equation  and 
integrating  over  the  spectral  vector.  This  forms  a  non-convex 
function  of  the  control  signal,  and  must  be  optimized  numerically 
at  each  time  step.  A  two-state  example  is  used  to  expose  the 
interesting  robustness  characteristics  of  the  proposed  controller. 

1.  Introduction 

Models  in  modem  stochastic  optimal  control  algorithms  like 
the  linear  quadratic  Gaussian  (LQG)  and  the  linear  exponential 
Gaussian  (LEG)  assume  linear  dynamics  and  additive  process 
and  measurement  noises  described  by  the  Gaussian  probability 
density  function  (pdf).  The  Gaussian  distribution  function  has 
very  light  tails,  so  that  large  deviations  are  essentially  impos¬ 
sible.  Therefore,  the  LQG  and  LEG  algorithms  do  not  perform 
well  in  the  presence  of  heavy-tailed  or  impulsive  uncertainties. 
In  many  practical  applications,  such  as  radar  and  sonar  systems 
affected  by  atmospheric  and  underwater  acoustic  noises,  more 
impulsive  uncertainties  are  observed  [1],  Impulsive  behavior 
is  also  more  effective  at  modeling  adversarial  motion,  as  is 
air  turbulence,  which  is  better  described  by  distributions  with 
heavier  tails  than  the  Gaussian  [2], 

Therefore,  in  this  paper  we  propose  a  system  model  that 
assumes  linear  dynamics  driven  by  additive  process  and  mea¬ 
surement  noises  described  by  Cauchy  pdfs.  Both  the  Cauchy 
and  Gaussian  pdfs  belong  to  a  class  of  distributions  called 
the  symmetric  or-stable  (Sa-S)  class,  whose  members  are 
described  by  their  characteristic  functions.  A  full  treatment 
of  the  Sa-S  class  can  be  found  in  [3].  The  Cauchy  pdf  is 
in  a  subset  of  this  class  whose  members  have  infinite  second 
moments.  In  addition,  the  mean  of  the  Cauchy  pdf  is  not  well 
defined. 

Algorithms  for  optimal  estimation  and  control  of  scalar 
linear  systems  driven  by  Cauchy  distributed  process  and 

This  work  was  partially  supported  by  Air  Force  Office  of  Scientific 
Research,  Award  No.  FA9550-10-1-0570,  and  by  the  United  States  -  Israel 
Binational  Science  Foundation,  Grant  2008040. 


measurement  noises  have  been  developed  previously  in  [4,  5]. 
There,  the  conditional  performance  index  for  model  predic¬ 
tive  control  is  determined  directly  by  taking  the  conditional 
expectation  of  the  objective  function  using  the  probability 
density  given  the  measurement  history  as  presented  in  [4].  A 
dynamic  programming  algorithm  is  also  developed  in  [5].  It  is 
shown  that  the  solution  to  the  dynamic  programming  recursion 
is  intractable  because  of  the  need  to  average  over  future 
measurements  in  determining  the  optimal  return  function. 

In  this  paper,  the  Cauchy  optimal  control  algorithm  for 
scalar  systems  [5]  is  extended  to  systems  with  a  vector  state. 
For  the  vector  case,  the  conditional  pdf  (cpdf)  given  the  mea¬ 
surement  history  is  not  available.  However  the  characteristic 
function  of  the  cpdf  can  be  recursively  propagated  [6,  7].  The 
significant  contribution  in  this  paper  is  evaluating,  in  closed 
form,  the  conditional  performance  index  using  the  cpdfs 
characteristic  function  instead  of  the  cpdf  itself,  and  integrating 
over  the  spectral  variables  instead  of  the  state  variables. 

Although  the  cpdf  is  not  available  as  a  function  of  the  state 
vector,  the  conditional  expectation  of  the  objective  function, 
i.e.  the  conditional  performance  index,  can  be  computed  using 
the  characteristic  function  of  the  cpdf,  which  is  available  as 
a  function  of  the  spectral  vector  [7].  The  objective  function 
is  cast  as  a  product  of  functions  resembling  Cauchy  pdfs, 
which  are  easily  transformed  into  a  function  of  the  spectral 
variables.  Consequently,  the  conditional  performance  index  is 
found  in  a  closed  form.  Due  to  its  complexity,  the  optimal 
control  signal  is  determined  by  numerically  optimizing  this 
conditional  performance  index  in  a  model  predictive  control 
setting. 

The  remainder  of  the  paper  is  structured  as  follows.  The 
controlled  system  model  is  presented  in  Section  II.  An  ap¬ 
propriate,  computable  performance  index  for  this  problem  is 
presented  in  Section  III  and  subsequently  transformed  from 
the  state  variable  to  the  spectral  variable  form.  In  Section  IV 
the  spectral  integrations  required  to  determine  the  conditional 
performance  index  are  reduced  to  an  integral  formula  that  can 
be  evaluated  in  closed  form.  Section  V  addresses  a  special 
case  of  systems  with  two  states.  Here,  using  an  alternative, 
simplified  form  of  the  two  state  cpdfs  characteristic  function, 
the  conditional  performance  is  determined  in  closed  form.  In 
Section  VI  numerical  examples  are  given.  Conclusions  are 
given  in  Section  VII. 


11.  Description  of  the  Model 
This  paper  deals  with  a  discrete  time,  linear  system  de¬ 
scribed  by 

x(k  +  1)  =  Ox(k)  +  At j(k)  +  Vw(k) 

z(k)  =  Hx(k)  +  v(k)  (1) 

where  x[k)  £  Rn  is  the  state  vector,  u(k)  is  a  scalar  deter¬ 
ministic  input,  z[k)  is  a  scalar  measurement,  and  w(k)  and 
v(k)  are  scalar  independent  Cauchy  distributed  process  and 

measurement  noise  inputs  with  medians  at  zero  and  scaling 
parameters  of  (3  and  y.  respectively,  so  that  their  pdfs  are 
given  by 

fwk  w(k)  —  w2(/t)  +  p2  ’  tv*  v(k)  —  v2(k)  +  y2  ■  (2) 

The  characteristic  functions  of  these  pdfs  are 

<Pwk(o)  =  cpvk(o)  =  e~Y^,  (3) 


where  CT  is  the  scalar  spectral  variable. 

The  initial  conditions  are  assumed  to  be  independent 
Cauchy  distributed  random  variables  with  the  pdfs 

n  a/ tt 

fx b(x(0))=  (0))2  +  a2-  (4a) 

/=  i  ' 

Its  characteristic  function  is  given  by 

n 


<px„(  v)  = 


n 

/=  i 


Gf,  V,+jXi(0)V, 
e  l  l 


(4b) 


where  Vi  is  an  element  of  v  e  Rn. 

The  stochastic  system  (1)  can  be  decomposed  into  two 
systems,  one  driven  by  u(k)  and  one  by  w(k),  by  exploiting 
the  linearity  of  the  system.  Let  x[k)  and  z[k)  be  the  part  of  the 
system  driven  by  the  control  u[k)  only,  and  x(k)  and  z[k)  be 
the  part  of  the  system  driven  by  the  process  noise  w(  k)  only 
and  contains  all  the  underlying  random  variables.  Then, 

x(k)  =  x[k)  +  x(k)  (5a) 

z{k)  =  z(k)  +  z(k).  (5b) 


The  controlled  part  of  the  system  is  described  by 


x(k)  =  Ox[k  -  1)  +  Au(k  -  1)  (6a) 

z(k)  =  H  x(k)  (6b) 


with  initial  condition  x(0).  The  process  noise  driven  part  is 
given  by 

x(k)  =  0x(k  -  1)  +  Vw(k  -  1)  (7a) 

z(k)  =  H  x(k)  +  v(k).  (7b) 


The  process  and  measurements  noise  pdfs  were  defined  in  (2), 

while  the  initial  condition  of  this  stochastic  model  is  Cauchy 

distributed  with  a  pdf  given  by 

n 

^(x(0))=  "~x2a/TT  2-  (8a) 

/=-!  /(0)  +  OT/ 

Its  characteristic  function  is 

ri  -or;  I V,  | 

0X,(  V)  =  e  ■  (8b> 

/=  i 


The  above  decomposition  will  be  used  to  derive  the  Cauchy 
controller. 

Let  the  state,  measurement,  and  control  histories  used  in  the 
control  problem  formulation  be  defined  as 

Xr  ■  =  {x(R), . . . ,x(m)},  (9a) 

Zk  :=  {z(0),...,z(k)},  (9b) 

URm  ,  Ur  Ef  (9C) 

where  F  is  the  class  of  piecewise  continuous  hinctions  adapted 
to  the  CT-algebra  Ok  generated  by  the  measurement  history,  i.e. 
the  control  is  a  random  variable  that  is  measurable  with  respect 
to  events  in  Ok  [8].  Moreover,  in  [9]  it  is  shown  that  Uk  is 
adapted  to  the  CT-algebra  bk  generated  by  ^  ,  which  means 
that  the  control  is  measurable  on  events  generated  by  only. 

III.  Derivation  of  tije  Cost  Using  Characteristic 
unctions 

Our  proposed  controller  is  an  m-step  model  predictive 
controller  [10]  that  uses  current  and  past  measurements,  and 
averages  over  future  process  noise.  At  each  time  step,  the 

conditional  performance  index  is  computed.  Since  the  perfor¬ 
mance  index  will  be  shown  to  be  a  nonconvex  function  of 

the  control  sequence,  it  is  maximized  numerically.  Once  the 
optimal  control  sequence  of  length  m  is  computed,  only  the 
first  control  in  that  sequence  is  applied.  At  the  next  step,  a  new 


measurement  is  taken  and  the  process  is  repeated,  producing  a 
new  optimal  control  sequence  and  applying  only  the  first  one. 
In  this  paper,  we  study  the  optimal  stochastic  state  regulation 
problem,  noting  that  the  tracking  problem  can  be  handled  in 
a  similar  fashion.  Our  regulation  problem  will  have  a  finite 
horizon  of  length  m  such  that  the  terminal  state  occurs  at 
time-step  p  -  k  +  m. 

Similar  to  the  scalar  control  problem  presented  in  [5],  the 
control  objective  function  is  chosen  as  a  product  of  Cauchy- 
like  functions  given  by 


<//  Xk+ 1 


\ 


um)  +  e. 


ra  nl+ir/TT 
x2(;  +  l)  +  n2 


(10) 


i=k 


r=  1 


/'+1,r 


Then,  the  the  performance  index  conditioned  on  the  current 
measurement  history  and  averaged  over  future  process  noises 
is  given  by 


j*  =  max 

k,p 


E  y  Xk+ 1 

U&~ 1  e  F 

=  max  E  E  ip  XPk+vu[  1 

or1  g  f 

=  E  max  E  ip  Xk+1,  Uk 


UP~'  G F 


E  J* 


,  (11) 


where  the  interchange  of  the  maximum  and  expectation  oper¬ 
ations  is  due  to  the  fundamental  theorem  in  [11]. 

We  are  now  concerned  with  determining  the  analytic  form 

for  the  conditional  performance  index  J-  Using  (10)  it 

Z-k  ^ 


becomes 

J~  =  E  Ifl  XP  ^yp-1 


zk 


| 

tn1 

n 

n 

E 

i=k 

■  r=1 

JOO 

P-1 

n 

n 

n 

i=k 

r=  1 

k+ 1  k 


li^.r/TT 

yz 


n.  „  /tt 

'/+1,r 


- O/TT  X~‘ 

2  2  2  Z/( 

U  (/)  +  <f(  \ 

_ 2 _ 2 


xr(i  +  1)  +  /7/+1  r 

u  (/)  +  <f, 

X  fw(x(p)  \x(p  -  1))  ■  %(x(/(  +  2)  |x(/c  +  1)) 

x  fxM\%ix(k  +  l)\Zk)dx-\(k+  l)...dxn(k+  1) 
x  dxi(/c  + 2)...  dXn[k  +  2) . . .  dxi(p) . . .  dx„(p)  (12) 

For  now,  let  us  only  consider  weighting  on  the  terminal  state, 
x(p),  and  on  the  m  scalar  control  inputs.  The  control  weighting 


functions  Mu 


'p-  1 


Zi/TT 


can  come  out  of  the  integral. 


i=k  u2(i)+cj 

Ji^t]*Tt«S,tfeft/Qiie^i-1Ilsi4ad1f&eilff8gpSliiinobl?)o^^ 


{xi  [p), . . .  ,xn[p)}.  Thus,  for  notational  convenience  we  can 
drop  the  time-step  index  in  this  integral  and  write  it  over 
{Xi,...,Xn}  as  \ 

r°°  1  m 


J~  =  Mu 

zk 


q  /tt 

P-r _ 


^ 1  Xr  +  Xr  2  +  q2pr 


m+i/TT 

(14a) 

r=- 1  Xr  +  Xr  +  nf+^,r 

ri  11- 

e~nP,rWr  1  -7Xr 

(14b) 

1=1 

1  n 

Mu  n 

=  (2rr)n  _  r=  i 


\ 


e~np.r  Wr\  +jxr(p)Vr 


IV.  The  Conditional  Performance  Index 
Consider  the  integral  over  vn  in  (15), 


ln  = 


foo  I  n  \ 

n  nP,r  Er,V  I  jx(p). 


r=  1  e 


I 


<?Xp  |Z/c 


(v)dVn 


—  oo 

r 


00  _  Hp,r\  Er.V  |  +j  x(p),V 

e  r=  1  (p^\;Jv)dvn  (16) 


The  cpdf  for  the  state  x  ( /c)  is  denoted  as  Ly-  17, .  The  unnor- 
malized  cpdf  (ucpdf)  is  denoted  as  1  -  -  —  ~  ■  f~ 

where/7  is  the  pdf  of  the  xk\zk  k\zk  zk 

,  ~  measurement  history  and  has 

a  kno  zk 

alue.  In  [12, 13],  the  characteristic  function  of 
wn  v- 

the  ucpdf  cp  ~  (v)  is  recursively  propagated;  the  charac- 
xk  \zk 

Rustic  function  of  the  normalized  cpdf  is  (pyk \Zk(v)  = 

xk\zSv)/fzk -  where  fZk  =  (Pxk \~  (v)  U=o.  From  [12,13] 

Zk 

the  form  of  the  characteristic  function  of  the  ucpdf  at  time  k 
is  shown  to  1^_  _  nt  k 


YH/i/v) 


k\k 

xk\ zk=  g i  ( ygi(v))e 

/■=  1 


where  xu*  1  \ 

yk'k(v)=  q^sgn  a*1*,  v  e  Rk 
gi  i,R 

R=1 


iR 


(17a) 

(17b) 


X  fxp\Zk(x\Zk)d~x^  ...dxn.  (13) 

The  cpdf  fx  \z(x  | Zk)  can  be  evaluated  in  closed  form  for 
scalar  systems  [4].  However,  for  vector  state  systems  it  is  the 
characteristic  function  of  the  cpdf,  <Pxp\z  ( v)  ■  that  is  evaluated 

spectral  variable  v  instead  of  the  pdf  variable  x. 

Define  the  product  over  r  in  the  integral  in  (13)  as  Rx  and 
its  Fourier  transform  as  Lx, 


Rx(x  +x)  = 


(v)  = 


j<i«  1 

_  _X  k\k 


\  1  \ 


Pi,R  aftk,  v  +J  b,klk,v  (17c) 


R=1 


,  .  „k\k  k\k  k\k  k\k  k\k  ,k\k 

and  the  parameters  n  ,  n  '  ,  q  ,  p  ,  a  ,  P  are 

t  e,i  iR  iR  iR  i 
generated  sequentially  from  k  =  0. 

For  the  MPC  algorithm,  the  characteristic  function  of  the 

Xpiz^^)  is 

(v)  —  cp~  ~(®mTv)<pw((®m-'V)Tv) 


<P 


XP \Zk  Xk  \Zk 


X  ■ 


■  x  (pw((®r)TV)(pw(\-V) 


./I 
/=  1  ( 

v  m- 1 

x  exp  —  /?  0  V,v 


■p\((t>r,v)\-p\(r,v)\ 


Using  these  definitions,  we  can  apply  Parseval’s  equation  over 
each  variable  in  (13)  to  express  the  conditional  performance 
index  as  an  integral  over  the  spectral  variable  V, 

M  p° 

J~  =  u  L*(v)  ■  ( v )  dv  ...dv 

7  ,  (Px  Z 

Zk  (Z/Tp7  p\\  1 


(18) 

In  (18)  we  add  m  terms  to  the  sum  in  y^v)  of  (17c). 
By  combining  the  exponent  in  (16)  with  that  in  (18),  the 
combined  exponent  in  the  integrand  of  (16)  has  in  total  of 
nk^k  +  m  +  n  real  terms,  and  the  imaginary  part  is  composed 
of  two  components.  Define  the  following  terms 

P«  =  P«.  a?i‘  =  <Ki‘  for  R  =  1, . . . , 


PiR 


x<P%,\a(v)dVi...dv„,  (15) 
where  L*  is  the  complex  conjugate  of  Lx.  The  next  section 


PiR  =  PpR,  gk\k  _ 


=R-(-nH,k+ 1)  (19a) 

kjk 


IjR  ~  r 


and 


]jk\ k  k\k 


fnr  R=1,.,ne, 
Ior  „  .  k/k 

r=R-(nei  +m) 

k\k  k\k 


shows  how  to  evaluate  these  n  nested  integrals  sequentially  in  =  ^,  +  Xp,  nej  =  fa,  +  m  +  n^k 

closed  form.  Q;R  =  0  for  R  =  nej  +  1, . . . ,  hej 


(19b) 


Using  these  definitions,  the  integrand  in  (16)  becomes 

v(v)  =  f  tp,(v)  =  gklk(yklk(v))  ■  ey*lkv) 

>  gi 

/=  i  /=  i 

where 

-M/c  "  *1*  1  \ 

gi  (v)  =  Q/r  sgn  a(R  ,  1/ 

R=  1 

^  1  \  1  \ 

- R|/c  Ar|R  R|R  +  j  ~  k\k 

yeiM=  Pir  aiR  ,  v  v  blR  ,  v 

R=  1 

The  integration  in  (16)  is  performed  for  e^ch  element  of  v 
in  turn.  Beginning  with  Vn ,  decompose  V  =  v  -]  where 

vn 

k/k 

|-00  I  poo 

In  =  ip(v)dv=  I”  °°\  lfJi(  v)dv 


(20a) 


(20b) 


(20c) 


v  £  Rn  1 .  Then, 


—  oo  — oo  — oo  — oo 

oo  klk  | — | 

r 

=  ■■■  c  yjj{vn,  v)dvn^  dv  (21) 

/=  1 

—  CO  — oo  — oo 

The  objective  is  to  reduce  the  inner  integral  in  (21)  to  a  form 
that  is  obtained  in  closed  form  using  the  integral  formula 
developed  in  [12, 13].  First,  since  akRk  multiplies  v  in  the  sign 
function  in  (20b)  and  the  absolute  value  filiation  in  (20c), 

they  are  decomposed  as  ak^k  =  ak]>k  ~k\k  ,  where  ak^k  is 
iR  iR  aiR  iR 

a  scalar  and  ak^k  £  Rn~\  Therefore,  the  inner  products  in 
(20b)  and  (20c)  become 

1ai$k,vK=  akR]k,v  -  -akRvn  ■  (22) 

In  order  to  rewrite  (22djn  a  form  consistent  with  the  integral 
formula  in  [12, 13],  —  a  K  is  divided  out  of  the  second  term. 

If  ak^  =  0,  then  the  term  e  loses  dependence  on  v n 

We4tao%e  'GflfiEiifWUfl  (2i)- bTeherefore’ 


as 


iR 


-ak\k 


iR 


,v  =  a 


k\k 

iR 


sgn 
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k\k 

PiR  ~Vn 


(23a) 


where  uk^k  =  ~^k  ,  v  .  Therefore,  the  elements  in  (20b) 

iR  _ klk  V 

and  (20c)  are 

qk\k 
iR  sgn 

Pkok 
yiR 


these  ddfinitiori?,  the  inneiP  integral  d?  (21)  of  tfte  form 

fOO 


£iR 

1  \ 

_  k\k 
aiR  ,  V 

1  A 

ak\k,  V 
aiR 

k\k  k\k 

=  QiR  sgn  PiR  ~  vn 

(23b) 

=  pk \k  k\k  ~  Vn 

iR  PiR 

(23c) 

,k\k  ~ 

sgn  —a 

Pk  and  pklk  =  pklk  ak'k  . 

.  Using 

tpi(,v)dvn  =  e 

□ 

x  exp 


j~bklk,v 


\r°°  □ 

gklkD  ^Rssn 

i  R=  1 

□ 


0- 


nW 

pk\k  /Jk\k  ~ Vn  +ibk\kVnn  dvn. 

iR  iR  i 


□ 


(24) 


R=1 


The  convolution  integral  in  (24)  is  shown  in  [12, 1 3^ ^  to  have 

a  closed  form  solution  composed  as  a  sum  with  hjR  terms, 
each  of  which  is  structurally  similar  to  the  terms  in  yJi(v). 
That  is,  there  will  be  a  new  g  function  which  is  a  function  of 
signs  of  inner  products  of  v. 

Therefore,  this  integration  process  can  be  repeated  until  all 
of  the  integrals  are  taken,  and  a  closed  form  solution  of  the 
conditional  performance  index  is  determined.  This  pattern  will 
be  seen  in  the  next  section,  where  the  conditional  performance 
index  for  the  two  state  system  is  explicitly  obtained. 

V.  The  Conditional  Performance  Index  for  a 
Second  Order  System 

Now,  let  us  limit  our  discussion  to  a  second  order  system  in 
order  to  use  the  structure  for  the  cpdf  s  characteristic  function 
presented  in  [7].  We  use  this  alternate,  two  state  structure  for 
the  characteristic  function  of  the  cpdf  in  order  to  make  the 


subsequent  derivations  and  computation  more  tractable.  This 
is  due  to  a  simpler  structure  for  the  exponential  argument  in 


(17c),  which  produces  fewer  terms  in  the  smn,  as  well  as  a 
simpler,  closed  form  representation  for  the  g  coefficients  in 

(17a). 

The  structure  for  the  characteristic  function  for  the  ucpdf  is 
given  by  ^ 

~(v)=  Gj(  v)  X 

xp\Zk 

/=  i 


exp 


□ 

□  _ 


R=1  P, 
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b:k,v  +j 


k,i 


R=1 


ZkPk  R  v 


(25) 


which  consists  of  a  sum  of  Nk  similar  terms.  Each  of  these 
terms  has  a  coefficient  G/(\/)  and  an  exponential  whose  argu¬ 
ment  involves  a  sum  of  absolute  values  equivalent  to  (17c).  In 
the  ( 
terms; 


:  exponential  argument:  there  is  a  sum  of  L'  absolute  value 
ns;  the  PH  and  ZH  values  represent  real,  Scalar  constants 


where  P h 


k,i 


k,i 

MKk. 

k,i 


kj  >  0;  the  B  s  are  1  x  2  row  vectors,  called  the 

fundamental  directions;  and  the  MPk,i  represent  integers  that 
index  the  fundamental  directions  that  multiply  v. 

All  of  these  parameters  correspond  with  those  of  the  struc¬ 
ture  for  the  gpdf  s  characteristic  function  presented  in  ^ction 


IV.  The  P, 

bklL 


k,i 


and  Zkj  parameters  correspond  to JD.jR  and 

2  TCspectiveW  and  the  fundamental  directions  Bk  are  the 
same  as  the  a  1  vectors. 

The  coefficients  G;( v)  equal  the  gk^k  from  (17a),  and  are 
rational  polynomial  of  sums  of  sign  functions,  given  by 

1  f 
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r=i  j8k_R  +  y  +  Sk_R(Bkv) 
j9  y  + 
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k-R 


B k-R  Bk 


(26a) 


where 


Lt-R-2 


Sk-R(BkV)  =  P,R  sgn  E^'v  ,  (26b) 

n=  1 

0/c'-R  =  Zk-R  ~  Z-khkL$\  T  _  ZkB^RW  T.  (26c) 

The  arguments  y^^v)  of  gk^k,  given  in  (17b),  correspond  to 
gi  i 

the  Sk_R(Bkv)  defined  above. 

Next,  the  transformed  objective  function  in  the  performance 
index  is  given  by 

L*{p)(  v)  =  e~nPA  I  vi  I  +jx i  i/i  -  nP 2  I  v2 1  +7x21/2  (27) 

where,  since  k  will  be  a  constant  through  this  process,  the 
time  subscript  ofx(p)  is  replaced  with  element  subscripts  as 

x(p)  =  [xi  x2].  The  2-state  specific  version  of  (15)  then 
becomes 

Mu  ^ 
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+7  ZR  B  kj  v  dv.  (28) 
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R=  1 


The  integral  with  respect  to  1/2  is  now  taken.  We  use  a  second 
subscript  to  denote  the  individual  elements  of  the  rows  of  Bk 

as  6^'  =  B^j  .  This  allows  us  to  decompose  the 

complex  part  of  the  exponential  argument  as 

X  I  \ 

J  ZR  BMl  „ 


k,i  k 
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=j  1/1+7  Z^B™*'-  1/2,  (29) 
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and  rewrite  J-  as 
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Denote  the  inner  integral  with  respect  to  1/2  in  (30)  as  l2 : 


|00 

/2  =  Gj(  1/1, 1/2) 
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The  integral  /2  is  over  the  1/2  variable,  but  it  also  contains  1/1 
in  the  absolute  value  terms  in  the  argument  of  the  exponential. 
In  order  to  solve  it,  we  need  to  use  the  integral  of  absolute 
values  method  presented  in  [7],  which  requires  writing  the 
integral  injhe  form  pven  in 

I2  =  G/u  PRSgn(pR-a)- 
R=1 

□  □ 

x  exp  -  -  k+pR\pR-o\  +  jZkd  do,  (32) 

R=1 

as  was  done  in  obtaining  (24).  That  method  involves  defining 
a  set  of  scalar  constants  pR  and  ,  as  well  as  scalar  variables 

Pr  that  depend  on  1/1 ,  and  thus  are  constants  in  thi^  integration. 

wcr,ill)i§6te'/iHfUi9f1i(y/(5assume that al1  the Bk2  /=  h hen. 

set  that  transforms  /  into  the 

u+ 1 

R  1  2 

integral  of  absolute  values  structure.  This  assumption  does  not 

mr 

affect  generality,  because  if  one  of  the  Bk  '2  does  equal  zero 
for  some  /,  then  it  would  multiply  the  V2  variable  by  0  and 
thus,  that  absolute  value  term  would  not  be  a  function  of  V2 
and  would  come  out  of  l2  and  integrated  later. 

The  set  of  variables,  denoted  {Pr}.  is  constructed  as 

Bmv'  +  BtU^  =  ~BmR  -B$v  1 
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where 
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R  =  L'k  +  1. 
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(33b) 

Similarly,  for  the  argument  of  the  exponential,  we  can  con¬ 
struct  a  set  of  {Pr}i  ,+i 

f  as 
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and  a  scalar  number  7/  as 
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The  solution  to  an  integral  of  an  exponent  of  absolute  values 

requires  dividing  the  domain  of  integration  into  regions  in 
which  the  integrand  is  continuous.  Since  G;(vj  is  piecewise- 

constant,  its  discontinuities  lie  on  the  boundaries  of  these 


regions,  and  hence  Gi(v)  is  treated  as  a  constant  in  each 
integral.  In  order  for  the  G;(  v)  coefficients  to  be  consistent 
with  the  form  in  (32),  use  the  tilde,  bar,  and  hat  substitutions 
similar  to  the  measurement  update  process  described  in  [7]  in 
order  to  write  G;  in  (31)  as  G,: 
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R=1 


The  sums  in  the  rational  polynomial  G/(  v)  contain  different  p 
constants  but  the  same  set  of  sgn ( /Jr  —  a).  Hence,  it  is  written 
as  G  for  shorthand. 

Then,  let  a  =  V 2  in  order  to  write  the  one-dimensional 
integral  in  (32).  The  solution  is  given  as  a  sum  of  L'k+  1 
terms  as 
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(35) 


This  complicated  looking  (35)  simplifies  readily  into  a  simple, 
double-sided  scalar  integral  over  t/i , 


/2(  1/1 )  =  lTa/,m  +  7  d/.msgn  ( i/i)>  ■  e-0™  ■  I V1 1  +j  . 

m=1 

(36) 

This  simplification  is  based  on  algebraic  relations  used  in 
the  estimator’s  measurement  update  process  in  [7],  as  well 
as  constants  Di.m,  D  i,m->  and  Di,m  defined  as 
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R=1,R/= 
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R=1,R/= 

m 

Z^+1 

Di,m  '  sgn (1/1)  =  pRSgnfpR—  pm)  (37c) 
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Denote  (he  oqjer  integral  with  respect  to  I/,  in  (30)  by 
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This  integral  has  a  form  identical  to  the  measurement  update 
process  for  a  scalar  system  [4].  Its  solution  is  given  by 
_ ^i. m  J  d  1  ,,,| _ 
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Finally,  the  conditional  performance  index  in  (30)  is  given  by 
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This  closed  form  conditional  performance  index  is 
non-convex  and  depends  on  the  control  input  sequence 
{u(k), . . . ,  u[p  —  1)}  in  a  complex  way;  specifically,  the 


parameters  a;,m,  d;,m,  and  Xi  depend  on  the  control  sequence. 
Thus,  we  maximize  (40)  numerically  using  the  accelerated 
gradient  search  method  [14].  The  optimization  is  done  in  two 
steps:  first,  the  global  optimum  of  the  double  sum  term  in  (40) 
without  the^ontrol  weighting  terms  is  optimized  with  respect 
to  Xi  X2  =  x(p);  then,  that  final  state  is  used  to  generate 
a  control  sequence  as  an  initial  guess  for  the  second,  local 
accelerated  gradient  search  optimization  step. 

VI.  Numerical  Examples 

Here,  we  present  two  sets  of  examples,  the  first  of  which 
shows  the  optimal  control  L/( 0)  *  versus  the  measurement  z(0) 
for  the  first  measurement  update  only,  and  the  second  set 
shows  two  multi-step  examples.  All  of  these  examples  use 
a  two-step  horizon,  i.e.  m  =  2,  so  that  there  exists  a  control 
sequence  that  can  drive  our  two-state  system  to  the  origin  over 
this  horizon  length.  However,  as  we  are  using  model  predictive 
control,  only  the  first  control  input  of  this  sequence  is  applied 
at  that  time  step. 

All  of  our  examples  compare  our  Cauchy  optimal  model 
predictive  controller  with  a  similar  LEG  model  predictive  op¬ 
timal  controller.  The  LEG  estimator  assumes  that  the  stochastic 
inputs  are  described  by  the  Gaussian  pdfs  that  are  closest,  in 
a  least  squared  sense,  to  the  given  Cauchy  pdfs;  and  the  LEG 
controller  assumes  that  its  objective  functions  of  the  state  and 
control  resemble  scaled  Gaussian  pdfs  that  are  closest,  in  the 
least-squared  sense,  to  the  scaled  Cauchy  pdfs  in  (10).  The 
LEG  controllers’  responses  are  shown  in  dashed  lines  in  the 
figures. 

The  first  set  of  examples  are  shown  in  Fig.  1.  These 
figures  show  the  applied  optimal  control  input  at  the  first  time 
step,  k  =  0,  given  the  first  measurement.  In  the  two  cases 
presented,  all  the  systems  parameters  are  the  same,  except  in 
Fig.  1(a)  y  >  ai  =  02  (i.e.  more  measurement  than  state 
uncertainty),  and  in  Fig.  1(b)  cri  =02  >  Y  (i.e.  more  state 
than  measurement  uncertainty). 

The  example  in  Fig.  1(a)  shows  that  the  Cauchy  controller 
reduces  its  control  effort  to  zero  as  the  measurement  deviations 
become  large.  This  is  in  contrast  to  the  LEG  controller,  which 
is  linear  and  thus  responds  strongly  to  large  measurement 
deviations.  This  behavior  in  the  Cauchy  controller  occurs 
when  the  measurement  uncertainty  is  larger  than  the  state 
uncertainty.  In  the  opposite  case  shown  in  Fig.  1(b),  the 
measurement  has  less  uncertainty  than  the  state.  In  this  case, 
the  Cauchy  controller’s  response  closely  matches  that  of  the 
LEG  in  a  neighborhood  of  the  origin,  and  in  fact  responds  even 
more  strongly  than  the  LEG  for  large  measurement  deviations. 

The  three  different  curves  in  both  of  these  figures  represent 
the  response  for  three  different  control  weights:  no  control 
weight,  Co  =  10,  and  Co  =  5.  As  expected,  heavier  control 
weights  (i.e.  smaller  C)  reduce  the  control  effort,  but  even 
without  any  control  weighting,  the  response  in  Fig.  1(a)  goes 
to  zero  for  large  measurement  deviations.  The  fact  that  this 
behaviour  is  seen  when  there  is  no  control  weighting  implies 
that  the  attenuation  of  the  control  signal  for  large  measurement 
deviations  is  due  to  the  cpdf  and  not  the  objective  function. 


The  complexity  of  evaluating  the  cost  grows  as  the  number 
of  terms  increases  across  time  steps,  as  indicated  in  (40).  For 
implementable  control,  this  growth  needs  to  be  arrested.  The 
full  information  characteristic  function  of  the  ucpdf  (25)  is  ap¬ 
proximated  by  a  characteristic  function  of  a  ucpdf  conditioned 
on  a  fixed  sliding  window  of  the  most  recent  measurements, 
taken  here  to  have  length  eight.  The  relative  error  in  the 
approximation  appears  to  be  10-6  or  smaller. 

The  second  set  of  examples  are  shown  in  Fig.  2.  All  of  the 
plots  show  the  state,  control,  and  noise  histories  for  the  given 
simulations.  The  difference  between  Fig.  2(a)  and  Fig.  2(b) 
is  that  the  process  noise  /3  and  the  measurement  noise  Y 
parameter  values  are  interchanged. 

It  is  interesting  to  compare  Fig.  2(a)  with  Fig.  2(b)  in 
light  of  Fig.  1.  In  Fig.  2(a),  there  is  more  uncertainty  in 
the  state  process  noise  than  in  the  measurement  noise.  When 
large  measurement  deviations  occur  (such  as  at  k  =  52), 
the  Cauchy  controller’s  effort  is  very  small.  In  contrast,  the 
LEG  controller  responds  with  a  large  control  effort  that  drives 
the  states  from  their  regulated  state  of  zero.  However,  when 
large  process  noise  inputs  occur,  the  state  deviates  and  the 
Cauchy  controller  applies  a  larger  control  effort  than  the  LEG, 
thus  regulating  the  state  more  effectively.  This  suggests  that 
when  the  measurement  noise  density  parameter  dominates  the 
process  noise  density  parameter  in  constructing  the  Cauchy 
controller,  the  effect  of  measurement  outliers  is  mitigated, 
while  still  responding  to  state  deviations  due  to  process  noise. 

On  the  other  hand,  in  Fig.  2(b)  there  is  more  uncertainty  in 
the  state  than  in  the  measurement,  and  the  Cauchy  controller 
behaves  very  much  like  the  linear  LEG  controller.  The  state 
trajectories  and  control  inputs  of  the  Cauchy  and  LEG  con¬ 
trollers  appear  equal,  but  actually  their  differences  are  much 
smaller  than  the  scale  of  the  axis  and  cannot  be  seen.  This 
suggests  that,  when  the  stochastic  parameters  allow  it,  the 
Cauchy  controller  follows  the  measurement  more.  It  responds 
in  a  more  linear  fashion  to  the  measurements,  imitating  the 
performance  of  the  LEG  controller  in  that  setting. 

This  behavior  is  seen  again  when  both  controllers  face 
Gaussian  noises,  as  in  Figure  2(c).  Here,  the  Cauchy  controller 
closely  follows  both  the  control  and  state  trajectories  of  the 
LEG,  which  is  the  true  optimal  solution.  Hence,  the  Cauchy  is 
robust  under  non-impulsive  noise  environments,  as  it  closely 
approximates  the  true  optimal  solution  given  by  the  LEG. 

VIE  Conclusions 

An  optimal  stochastic  controller  was  derived  for  vector- 
state,  linear,  discrete-time  systems  with  additive  process  and 
measurement  Cauchy  distributed  noises.  Since  the  Cauchy 
distribution  has  an  undefined  mean  and  an  infinite  second 
moment,  we  cannot  use  standard  objective  function,  e.g.,  the 
expected  value  of  a  quadratic  function  of  state  and  control 
variables.  Therefore,  a  new  and  computable  objective  function 
was  defined.  Opposed  to  previous  work,  the  characteristic 
function  of  the  cpdf  of  the  state  given  the  measurement 
history  and  the  Parseval’s  equation  are  used  to  express  the 


(a)  optimal  control  vs  the  measurement:  Q/  =  0-1,  Y  —  0-5. 


z(  0) 


(b)  optimal  control  vs  the  measurement:  Oz  =  0.5,  K  =  0-1- 

Fig.  1.  Parameters  used  are:  /7  =  [1  1],  P  =  0.02,  T  1  =  [1  1],  A1  =  [1  1], 
and  the  eigenvalues  of  d>  are  0.2455  ±  JO ■  1523. 

conditional  performance  index  in  a  closed  form.  This  closed- 
form  conditional  performance  index  is  optimized  numeri¬ 
cally  using  an  accelerated  gradient  search.  Examples  are 
presented  that  show  how  our  vector  state  Cauchy  controller 
compares  against  an  equivalent  LEG  controller,  demonstrating 
the  Cauchy  controller’s  performance  and  improved  robustness 
over  its  Gaussian  counterpart. 
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(c)  y  =  0.5,  f  =  0.1,  Gaussian  noises. 
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dashed  lines  are  LEG.  Simulation  parameters  are:  H  =  U  1],  O  =  [0.8  0.8], 
H  =  [1  1],  rT  =  [0.5  1],  At  =  [0.5  1],  m  =  2,  and  the  eigenvalues  of  ® 
are  0.8  ± 70.55.  There  is  no  control  weighting. 
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Abstract — An  optimal  predictive  controller  for  linear,  vector-state 
dynamic  systems  driven  by  Cauchy  measurement  and  process  noises  is 
developed.  For  the  vector-state  system,  the  probability  density  function 
(pdf)  of  the  state  conditioned  on  the  measurement  history  cannot  be 
generated.  However,  the  characteristic  function  (CF)  of  this  pdf  can  be 
expressed  in  an  analytic  form.  Consequently,  the  performance  index 
used  for  the  controller  design  is  evaluated  in  the  spectral  domain 
using  this  CF.  By  taking  the  conditional  expectation  of  an  objective 
function  that  is  a  product  of  functions  resembling  Cauchy  pdfs,  the 
conditional  performance  index  is  obtained  analytically  in  closed  form  by 
using  Parseval’s  identity  and  integrating  over  the  spectral  vector.  This 
forms  a  deterministic,  non-convex  function  of  the  control  signal  and  the 
measurement  history  that  must  be  optimized  numerically  at  each  time 
step.  A  two-state  example  is  used  to  expose  the  interesting  robustness 
characteristics  of  the  proposed  controller. 


I.  Introduction 

Control  of  dynamic  systems  in  real  world  applications,  from  en¬ 
gineering  and  science  to  economics  and  finance,  frequently  involves 
handling  uncertain,  stochastic  inputs.  These  uncertainties  affect  both 
the  actual  state  of  the  system  as  well  as  the  measurements  that 
the  controller  depends  on.  When  designing  controllers  for  stochastic 
systems,  it  is  often  assumed  that  the  uncertainties  are  described  by 
Gaussian  probability  density  functions  (pdf),  due  to  the  efficiency 
with  which  modern  methods  handle  them.  For  linear  systems,  al¬ 
gorithms  like  the  linear  quadratic  Gaussian  (LQG)  and  the  linear 
exponential  Gaussian  (LEG)  assume  linear  dynamics  and  additive 
process  and  measurement  noises  described  by  the  Gaussian  pdf  [1], 

In  many  applications  the  underlying  random  processes  have  an 
impulsive  character  producing  deviations  of  high  amplitude  and  small 
duration  much  more  often  than  the  Gaussian  assumption  permits 

[2] ,  Examples  of  such  processes  include  radar  and  sonar  noise 

[3]  and  disturbances  due  to  air  turbulence  [4],  Another  application 
is  adversarial  missile  guidance,  where  the  target  is  intelligent  and 
desires  to  evade  missile.  In  missile  guidance,  uncertainties  are  usually 
assumed  to  be  Gaussian.  However,  optimal  target  evasion  maneuvers 
involve  high  acceleration  rate  maneuvers,  i.e.  a  high  amplitude  and 
small  duration  input  [5].  Hence,  a  control  algorithm  assuming  that 
the  target  is  driven  by  a  light  tailed  distribution  would  not  capture 
this  behavior.  The  Gaussian  distribution  function  has  very  light  tails, 
so  that  large  deviations  are  extremely  unlikely.  Therefore,  the  LQG 
[HI)  and  LEG  (H oo)  algorithms  do  not  perform  well  in  the  presence 
of  heavy-tailed  or  impulsive  uncertainties. 

Impulsive  uncertainties,  like  those  mentioned  in  the  examples 
above,  are  better  described  by  heavy-tailed  distributions,  such  as 
the  symmetric  alpha-stable  (SaS)  distributions  [6],  These  distribu¬ 
tions  are  described  not  by  their  probability  density  functions  (pdfs), 
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but  by  their  characteristic  functions  (CFs).  They  are  of  the  fonn 
cp[v)  =  e~aa \v\a+Jfv ,  where  O  is  the  scaling  parameter,  fJ  is  the 
median,  v  is  the  spectral  variable,  and  the  characteristic  exponent 
a  detennines  the  type  of  distribution:  a  =  2  implies  the  Gaussian 
distribution,  and  0=1  implies  the  Cauchy  distribution. 

Estimation  assuming  Cauchy  distributed  noises  has  shown  im¬ 
proved  perfonnance  over  Gaussian  estimators  when  faced  with  im¬ 
pulsive  noises.  For  estimating  the  direction  of  arrival  of  a  signal 
to  a  sensor  array  in  [7],  maximum  likelihood  estimators  designed 
assuming  Cauchy  distributed  noises  were  shown  to  exhibit  perfor¬ 
mance  very  close  to  the  Cramer-Rao  Bound  against  SoS  noises 
with  characteristic  exponents  1  <  a  <  2.  Similar  performance  was 
observed  in  various  applications,  including  processing  data  in  a  multi¬ 
user  communication  network  [8]  and  radar  glint  [9].  However,  these 
studies  are  only  for  estimation  of  the  signal,  or  signal  parameters, 
and  do  not  attempt  to  control  their  respective  systems.  This  paper 
derives  a  framework  for  control  of  multivariate  linear  systems  driven 
by  Cauchy  distributed  measurement  and  process  noises. 

Algorithms  for  optimal  estimation  and  control  of  scalar  linear  sys¬ 
tems  driven  by  Cauchy  distributed  process  and  measurement  noises 
have  been  developed  in  [10, 11].  There,  the  conditional  performance 
index  for  model  predictive  control  is  determined  by  taking  the 
conditional  expectation  of  the  objective  function  using  the  probability 
density  given  the  measurement  history  as  presented  in  [11,12],  A 
dynamic  programming  algorithm  is  also  developed  in  [12],  It  is 
shown  that  the  solution  to  the  dynamic  programming  recursion  is 
intractable  because  of  the  need  to  average  over  future  measurements 
in  determining  the  optimal  return  function.  This  cannot  be  done 
in  closed  form  due  to  the  complex  dependency  of  the  optimal 
return  function  on  the  measurement  history.  Hence,  the  dynamic 
programming  solution  was  approximated  using  the  model  predictive 
control  method. 

In  this  paper,  the  Cauchy  optimal  control  algorithm  for  scalar 
systems  [12]  is  extended  to  systems  with  a  vector  state.  For  the  vector 
case,  the  conditional  pdf  (cpdf)  given  the  measurement  history  is  not 
available.  However,  the  CF  of  the  cpdf  can  be  recursively  propagated 
for  such  vector  state  systems  [13-17].  In  particular,  [16, 17]  presents 
an  efficient  algorithm  for  the  two-state  Cauchy  estimator.  Although 
this  control  methodology  can  be  applied  to  general  vector-state 
systems  in  [13,14],  the  closed  fonn  expression  for  the  conditional 
perfonnance  index  used  in  the  control  problem  will  be  based  on  the 
structure  presented  in  [16, 17]. 

The  significant  contribution  of  this  paper  is  evaluating  in  closed 
form  the  conditional  performance  index  using  the  cpdf  s  CF  instead 
of  the  cpdf  as  in  [12],  Preliminary  results  can  be  found  in  [18], 
The  objective  function  is  cast  as  a  product  of  functions  resembling 
Cauchy  pdfs,  which  are  easily  transformed  into  functions  of  the 
spectral  variables.  Consequently,  the  conditional  performance  index, 
found  in  a  closed  form,  is  a  deterministic  function  of  the  control  and 
measurement  histories.  Due  to  its  complexity,  the  optimal  control 
signal  is  determined  by  numerically  optimizing  this  non-convex 
conditional  performance  index  in  a  model  predictive  control  setting. 

The  remainder  of  the  paper  is  structured  as  follows.  The  controlled 
system  model  is  presented  in  Section  II.  An  appropriate,  computable 
performance  index  for  this  problem  is  presented  in  Section  III  and 
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subsequently  transformed  from  the  state  variable  to  the  spectral 
variable  form.  In  Section  IV  the  spectral  integrations  required  to 
determine  the  conditional  performance  index  are  reduced  to  an 
integral  formula  that  can  be  evaluated  in  closed  form.  Section  V 
addresses  a  special  case  of  systems  with  two  states,  using  the  efficient 
algorithm  presented  in  [17].  Here,  using  an  alternative,  simplified 
fonn  of  the  two  state  cpdfs  CF,  the  conditional  perfonnance  is 
determined  in  closed  form.  In  Section  VI  numerical  examples  are 
given,  demonstrating  the  performance  of  the  Cauchy  controller  under 
both  Cauchy  and  Gaussian  noise  conditions.  Conclusions  are  given 
in  Section  VII. 


Its  CF  is 

n 

(Pxt(v)  =  'r>  e~a!'V!'.  (8b) 

/=  l 

The  above  decomposition  will  be  used  to  derive  the  Cauchy  con¬ 
troller. 


III.  Derivation  of  the  Cost  Using  Characteristic 
Functions 


(1) 


II.  Description  of  the  Model 
This  paper  deals  with  a  discrete  time,  linear  system  described  by 
x(k  +  1)  =  <t>x(k)  +  Au(k)  +  r  w(k) 
z(k)  =  H  x(k)  +  v(k) 
where  x[k)  e  R"  is  the  state  vector,  u[k)  is  a  scalar  input,  z(k) 

is  a  scalar  measurement,  and  w(k)  and  v[k)  are  scalar  independent 
Cauchy  distributed  process  and  measurement  noise  inputs  with  me¬ 
dians  at  zero  and  scaling  parameters  of  jS  and  y,  respectively,  so  that 
their  pdfs  are  given  by 

fw  W(k)  =  J/TT  fv  v(k)  =  o  Y/TT  (2) 


v\r(k)  +  p2  \f(k)  +  Yz 

The  CFs  of  these  pdfs  are 

cpw(o)  =  cpv(o)  =  e~vWl, 


(3) 


where  CT  is  the  scalar  spectral  variable. 

The  initial  conditions  are  assumed  to  be  independent  Cauchy 
distributed  random  variables  with  the  pdfs 


fxAxl  D)  = 


/=! 


a/rr 

(X/(1)  —  x,(l))2  +  or? 


(4a) 


Its  CF  is  given  by 

CpxAv)  =  n  e~ 


i=  1  ai  v, +jxj(l)Vi  (4J,) 

where  v,  is  an  element  of  v  €  Rn. 

The  stochastic  system  (1)  can  be  decomposed  into  two  systems, 
one  driven  by  u[k)  and  one  by  w(k),  by  exploiting  the  linearity 
of  the  system.  Let  x[k)  and  z[k)  be  the  part  of  the  system  driven 
by  the  input  u(k)  only,  and  x[k)  and  z(k)  be  the  part  of  the  system 
driven  by  the  process  noise  w(k)  only  and  contains  all  the  underlying 
random  variables.  Then, 

x(k)  =  x(k)  +x(k)  (5a) 

z(k)  =  z(k)  +  z(k).  (5b) 


Our  proposed  controller  is  an  m-step  model  predictive  controller 
[19]  that  uses  current  and  past  measurements,  and  averages  over 
future  process  noise.  At  each  time  step,  the  conditional  performance 
index  is  computed.  Since  the  performance  index  was  found  to  be 
a  non-convex  function  of  the  control  sequence,  it  is  maximized 
numerically.  Once  the  optimal  control  sequence  of  length  m  is 
computed,  only  the  first  control  in  that  sequence  is  applied.  At  the 
next  step,  a  new  measurement  is  taken  and  the  process  is  repeated, 
producing  a  new  optimal  control  sequence  and  applying  only  the  first 
one.  In  this  paper,  we  study  the  optimal  stochastic  state  regulation 
problem,  noting  that  the  tracking  problem  can  be  handled  in  a  similar 
fashion.  Our  regulation  problem  will  have  a  finite  horizon  of  length 
m  such  that  the  terminal  state  occurs  at  time-step  p  =  k  +  m. 

Let  the  state,  measurement,  and  control  histories  used  in  the  control 
problem  formulation  be  defined  as 


Xpj  :=  {x(k+  l),...,x(p)},  (9a) 

Zk  :=  {z(Y), . .  .,z(k)},  (9b) 

Uk  :=  {u(k), . u(p  —  1)},  Uk  e  F,  (9c) 


where 


F 


is  the  class  of  piecewise  continuous  functions  adapted  to  the 


CT-algebra  Ok  generated  by  the  measurement  history,  i.e.  the  control 
is  a  random  variable  that  is  measurable  with  respect  to  events  in  Ok 


[20].  Moreover,  in  [12]  it  is  proven  that  u  is  adapted  to  the  CT-algebra 

k 

Ok  generated  by  which  means  that  the  control  is  measurable  on 
events  generated  by  zk  only. 

Similar  to  the  scalar  control  problem  presented  in  [11, 12],  the  con¬ 
trol  objective  function  is  chosen  as  a  product  of  functions  resembling 
Cauchy  pdfs,  given  by 
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k+l>  UkP 
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n  /tt 


u2(i\  +  C  x2(i  +'!)'+  n2  ■  (10) 

i—k  i  r=l  r  i+l,r 


The  controlled  part  of  the  system  is  described  by 

x[k  +  1)  =  Q>x(k)  +  l\u[k )  (6a) 

z(k)  =  H  x(k)  (6b) 

with  initial  condition  x(  1),  i.e.,  the  median  of  (4a).  The  process  noise 
driven  part  is  given  by 

x(k  +  1)  =  Ox(k)  +  T  w(k)  (7a) 

z(k)  =  H  x(k)  +  v(k).  (7b) 


The  process  and  measurements  noise  pdfs  were  defined  in  (2),  while 

the  initial  condition  of  this  stochastic  model  is  Cauchy  distributed 
with  a  pdf  given  by 


(8a) 


Then,  the  the  performance  index  conditioned  on  the  current  mea¬ 
surement  history  and  averaged  over  future  process  noises  is  given  by 
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where  the  interchange  of  the  maximum  and  expectation  operations  is 
due  to  the  fundamental  theorem  [1], 


fx, (*(!))=  ,n'  a/TT 


We  are  now  concerned  with  determining  an  analytic  form  for  the 


i=i  /(D  +  cr,- 
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conditional  performance  index  J  ,  where  using  (10)  it  becomes 
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x  fxkll\zjx(k+  1)  \Zk)dx\(k  +  1) . . . dxn{k  +  1) 
x  dx\(k  +  2) . . .  dxn(k  +  2) . . .  dxAp) . . .  dxn{p).  (12) 


For  presentation  simplicity,  in  this  derivation  we  will  only  consider 

weighting,  on  the  terminal,  state,  J<(p).,  and  .cm/  thEirfHi  scalar  control 
inputs.  The  control  weighting  functions  Er  oat  can 

i=k  u—  ( i)  <f' 

come  out  of  the  integral  in  (12).  Then,  the  product  over  /  inside  the 
integrand  has  only  the  term  for  /  =  p  —  1 .  For  notational  convenience 

we  can  drop  the  time-step  index  in  this  integral  and  write  it  over 


{Xu- 

.  .  ,  Xn}  as 

J 

=  miM  "  np/TT  +  -  __  \)d~Xx. 

.  .  dxn 

Zk 

r=  1  Xr  +  Xr  1  +  Hp.r  XP|Z*^  ^ 
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=  Mu  RAx  +  *)■/>  ~  (x\~k)dxi...dxn.  (13) 

p\zk  z 


The  cpdf  fxp  \  ~  (x  \  k)  can  be  evaluated  in  closed  form  for  scalar 
**  * 

systems  [10],  However,  for  vector  state  systems  only  the  CF  of 
the  cpdf  in  (13),  cpp\k{v),  can  be  evaluated  in  closed  form  [13]. 
Therefore,  when  computing  the  conditional  performance  index  we 
need  to  express  the  integral  using  the  spectral  variable  v  instead  of 

the  pdf  variable  x.  The  CF  is  the  inverse  Fourier  transform  of  a 

pdf,  and  therefore  CFs  retain  all  the  properties  of  Fourier  transforms. 

Hence,  we  use  the  same  transform  on  the  objective  function  Rx  in 
(13),  and  then  apply  Parseval’s  identity.  The  inverse  Fourier  transform 

of  Rx  in  (13)  is  denoted  Lx  and  given  by 


+00 
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r  n 

Lx(v)  =  Rx(X  +  x)dVTxd~x  =  e-nP.rWr\  ~jxr 


Using  these  definitions,  we  can  apply  Parseval’s  identity  over  each 
variable  in  (13)  to  express  the  conditional  performance  index  as  an 
integral  over  the  spectral  variable  V, 
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IV.  The  Conditional  Performance  Index 
Consider  the  integral  over  vr  in  (15), 

,°°  ( ITil  X 
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where  Er  is  the  r1*1  column  of  the  D-dimensional  identity  matrix.  The 
^is  denoted  as  fxk\z  ^he  unnormalized  cpdf 


xk\zk  =  fxk\  zk  '  fzk\  where  />  is  the  pdf  of 
the  measurement  historyana  has  aKlicwn  value.  In  [1T,14],  the  CF  of 

the  uepdf  cpn  lk(v)  is  recursively  propagated;  the-CF  of  the  normalized 
cpdf  is  (pk\Av)  =  cp  (v)/f  ,  where/7  =  (p  -  (v)  \  V=Q.  From 
k\k  zk  zk  xk\zk 

[13, 14]  the  fonn  of  the  CF  of  the  uepdf  at  time  k  is  shown  to  be 
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k\k  k\k  k\k  k\k  k\k  k\k 

and  the  parameters  nt  ,  ne  i ,  qi£  ,  pi£  ,  al£  ,  b ,  are  generated 

sequentially  from  k  =  1. 

For  the  MPC  algorithm,  the  CF  of  the  uepdf  is  to  be  propagated 
through  the  stochastic  dynamics  (7)  to  time  k  +  m  =  p  using 
the  propagation  formula  given  in  [13,17].  The  CF  of  the  m-step 

p\k 

propagated  cpdf  is  denoted  (p  ( v)  and  given  by 
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In  (18)  we  effectively  add  m  terms  to  the  sum  in  /Jfi  v)  of  (17c). 
By  combining  the  exponent  in  (16)  with  that  in  (18),  the  combined 
exponent  in  the  integrand  of  (16)  has  in  total  nej  +m  +  n  real  terms, 
while  the  imaginary  part  is  composed  of  two  components.  Define  the 
following  tenns 
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and 
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qt£  =0  for  R=nei  +  1, . . . ,  hei 
Using  these  definitions,  the  integrand  in  (16)  becomes 
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where  L*  is  the  complex  conjugate  of  Lx.  The  next  section  shows 
how  to  evaluate  these  n  nested  integrals  sequentially  in  closed  form. 
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The  integration  in  (16)  is  perfonned|  for  eacljiVl^qig^  qj"  l^irj^oHi. 
Beginning  with  Vn,  decompose  v  =  v  vn 
Then, 
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V.  The  Conditional  Performance  Index  for  a  Second 
Order  System 

Now,  let  us  limit  our  discussion  to  a  second  order  system,  i.e., 
n  =  2,  in  order  to  use  the  structure  for  the  cpdfs  CF  presented 

in  [16].  This  alternate,  two  state  structure  for  the  CF  of  the  cpdf 
takes  advantage  of  relationships  not  yet  generalized  to  the  general 

vector-state  case  that  drastically  reduces  the  number  of  terms  needed 

to  express  .the  CF.  In  particular,  there  is  a  simpler  structure  for  the 
exponential  argument  m  (17c),  as  well  as  a  simpler,  closed  torn 

k\k 

representation  for  the  g.  coefficient  functions  in  (17a). 

The  structure  for  the  CF  for  the  ucpdf  is  given  by 
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The  objective  is  to  reduce  the  inner  integral  in  (21)  to  a  form  that 


is  obtained  in  closed  form  using  the  integral  formula  developed  in 
-  k\k 

[13, 14],  First,  since  aj£  multiplies  V  in  the  sign  function  in  (20b) 


and  the  absolute  value,  function  jnfc(20c),  they  are  decomposed  as 
--k\k  _  \k\k  „ A| A  ,  where  a  ,  .  k\k  n- 1 

«/f' 


i£  is  a  scalar  and  ai£  E  R 


Therefore,  the  inner  products  in  (20b)  and  (20c)  become 


aklk 


k 

V  =  a  j£  ’ 


V  -  -afj^Vn 


(22) 


In  order  to  rewrite  (22)  in  a  fonn  consistent  with  the  integral 
formula  in  [13,14],  —  a/£|  is  factored  out  of  the  second  tenn.  If 
~^k=  0,  then  the  tenn  e  -v  loses  dependence  on  vn  and  it  is 
removed  from  the  inner  integral  in  (21).  Therefore,  only  ak£k  /= 


0 


\ 


(23a) 


needs  to  b^  considered.  Let  (22)  be  rewritten  as 

a*1"  _  VsgnC  hk^  k  k_ 

i£  *  y  —  3,-^  dj£  Hi£  Vn 

~J</k 

k\k  **  -  a 

where  pi£  =  -  ^k-k ,  v  .  Therefore,  the  elements  in  (20b)  and 

(20c)  are  **  ,  \ 


qk]k  V.k\k  XV  _k|k  (  klk 
i£  sgn  p,£  ,v  \  =  qi£  sgn  pi£  ■ 

-/c|/c  _k\k  ^  k\k  k\k 

/£ 


(23b) 


Vn 


f*i£  ’V\=  Pi£  ki£  ~  Vn  ,  (23c) 

where  q^k  =  q^sgn  —  ~ak^k  and  pk^k  =  pk^k  ak^k  .  Using  these 
/£  /£  /£  i£  /£  /£ 

definitions,  the  inner  integral  of  (21)  is  restated  as 

□  C 

r°°  nk^k  ( 

qji(v)dvn  =  e* -^lk’VxX°°  k\ktf*  _k\k  '  k\k  \ 
pD  q,£  sgn  pi£  ~vnLih 

—  OO  — oo  £—1 

□_  ^  □ 
xexp  pk\k  k\k  k\k 

□  ,£  /£  -  Vn  +  jb i  VnD  dvn.  (24) 

£=1 


which  consists  of  a  sum  of  Nk\k  similar  terms.  By  using  the 

efficient  algorithm  presented  in  [  1 7]  instead  of  the  general  vector-state 

algorithm  from  [13],  the  number  of  terms  needed  to  express  the  cdpf  s 

CF.  is  dramatically  reduced  (see  [17]  for. an  explicit  comparison), 
which  also  significantly  improves  the  efficiency  of  the  controller. 

Each  of  these  tenns  is  a  product  of  a  coefficient  function  G'k\k^ 
and  an  exponential  function  whose  argument  involves  a  sum  of 

absolute  values  equivalent  to  (17c).  In  the  exponential  argument: 
there  is  a  sum  of  L  absolute  value  terms;  the  P  and  Zr  values 

represent  real,  scalar  constants  where  P  >  0;  the  B  's  are  1x2 

i  k\k 

row  vectors,  called  the  fundamental  directions;  and  the  /W£  represent 
integers  that  index  the  fundamental  directions  that  multiply  v.  The 
parameters  for  the  two-state  estimator  are  constructed  recursively  in 
the  measurement  update  process,  as  described  in  [16].  All  of  these 


parameters  correspond  with  those  of  the  structure  for  the  cpdfs  CF 
presented  in  Section  IV.  The  P£  and  Z£  parameters  correspond  to 

k\k  k\k  '  '  M. 

pi£  and  b,  ,  respectively,  and  the  fundamental  directions  B  k^k  are 
the  same  as  the  ak J*  vectors. 

The  coefficient  functions  G^k (v)  equal  the  gk^k  from  (17a),  and 
are  rational  functions  of  polynomials  of  sums  of  sign  functions,  given 
by 

(  \  ( 

'  fl/f  x 


_  i  f  +  b  sgn  B  (  v  sgn  R  j  \ 

~  iftrfY' ^  ‘  k\k 

(  .  \  (  , 

+7'C/sgn  Btfi  v  +  yd, sgn  \ 


.  Vi  2  v 


'k\k  ’ 

1 


j& 


k-r  +  Y  +  Sk_r(Bk\kv)  1 

-  1  .  ,  (26a) 

jG'k_r-Y+  Sp_r(Bk\kv) 


where 


The  convolution  integral  in  (24)  is  shown  in  [13, 14]  to  have  a  closed 
fonn  solution  composed  as  a  sum  with  h  ^  terms,  each  of  which 
is  structurally  similar  to  the  tenns  in  v) .  That  is,  there  will  be  a 
new  g  function  which  is  a  function  of  signs  of  inner  products  of  v . 

Therefore,  this  integration  process  can  be  repeated  until  all  of  the 
integrals  are  taken,  and  a  closed  form  solution  of  the  conditional 
perfonnance  index  is  detennined.  For  a  vector  state  system  of  general 
order  n,  a  closed  form  analytical  solution  is  not  attainable  due  to  the 
complexity  of  the  coefficient  functions  gl  .  However,  a  closed  form 
expression  for  the  conditional  performance  index  can  be  obtained  for 
a  two-state  system,  presented  next,  by  using  the  structure  from  [17]. 


Lk-jrlk-r  ( 

Si-r(Bk]kv)  =  ♦  P.£ 

'  k-r\k-r\-\ 

£=1 


& 


sgn  v  ,  (26b) 

2 


1  M'  T  -1  Hvf/*  u 

-r  =  Z(k-r)  -  Z,Bk_r]k_r H  ZiBk-r\k-rH 


(26c) 


The  arguments  ^^(v)  of  gk^k,  given  in  (17b),  correspond  to  the 
_/  gi  i 

•=,k-r\k-r(Bk\kv)  defined  above. 

The  state  propagation  uses  the  same  formula  as  in  (18)  and  [13, 
17],  This  process  affects  the  exponential  argument  of  <f>  k\k  by  adding 
new  terms  to  the  real  part  and  a  transformation  on  the  fundamental 
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directions  BM‘ The  coefficient  functions  remain  unchanged  as 

polynomials  of  sign  ftmetions,  but  the  arguments  of  the  sign  functions 
have  the  same  transformation  as  in  the  exponential  argument,  so  that 
/  /  mT 

Gp\i<(v)  =  v)-  The  state  propagation  does  not  add  any 

new  terms  to  the  sum,  so  that  Np\k  =  Nk\k-  Hence,  the  m-step  state 
propagated  cpdf  s  CF  is  given  by 


Denote  the  Ith  inner  integral  with  respect  to  V2  in  (31)  as  /;, 2' 


jOO 


1  p\kM  = 


&p\k(v)exp 


/=  1 


£=\ 


£  M' 

Pi  Bplkv  +j 


£  M  j  '  □ 

Z/BP\k  vn. 


£=1 


(27) 


Using  this  expression  for  the  unnonnalized  CF,  we  can  derive  an 
expression  for  the  performance  index  in  closed  form.  The  transformed 
objective  function  in  the  performance  index  (14)  is  given  by 


L*  =  e-r)P,\  Wi  I  +  jx M  -  rip, 2 \v2\  +  jx2v2 


(28) 


where,  since  k  will  be  a  constant  through  this  process,  the  time 
subscript  of  x(p)  is  replaced  with  element  subscripts  as  x(p)  = 
[if,  X2]1.  The  2-state  specific  version  of  (15)  then  becomes 


7  =  M 

Zk  (2tt)2 


JOO 


u 


x(p)(v  }v  ?  ■  (p  \v  \  V 2)  dvlv 2 


u 


—  00 

|CO 


-  1 2rrn 


2f~ 

Zk 


exp ( — /7i  kil  -H2  \v2  \  +jxiVi  +jx2v2 ) 


-<s 


W 


/;>2  -  GP\k(v\,  v2)  x  exp 


1  £ 


£=l 


M  M 

P'  Bk,\v  1+  Bk2v2 


( \ 

r)2  0  +  1/2  +  j  ~  Z£  cm,  v2+jx2vi}  dv2.  (32) 
-  |  |  'Bk,  2  □ 

£=1 

The  integral  //, 2  is  over  the  V2  variable,  but  it  also  contains  1/1  in 
the  absolute  value  terms  in  the  argument  of  the  exponential  and  in 
the  coefficient  function  G‘p]k-  In  order  to  solve  it,  we  need  to  use 
the  integral  of  absolute  values  method  presented  in  [13, 14, 16].  This 
method  involves  defining  a  set  of  scalar  constants  p£  and  as  well 
as  scalar  variables  p£  that  depend  on  V\,  and  thus  are  constants  in 
this  integration. 

\f  =  0,  then  when  this  constant  multiplies  V2  the  variable 
disappears,  and  that  term  comes  out  of  the  integral.  This  will  affect 
the  specific  form  of  second  integral  over  V\ ,  but  the  method  presented 
here  can  still  be  used.  However,  for  simplicity  of  presentation,  in  this 
derivation  we  assume  that  all  theD^2  =  0.  Then,  we  can  construct 

a  ^p  £}f\lk  set  that  transforms  //2into  the  integral  of  absolute 
values  structure.  This  assumption  does  not  affect  generality,  because 
if  one  of  the  equals  zero  for  some  /,  then  that  absolute  value 
term  would  not  be  a  function  of  V2  and  would  come  out  of  lk2  and 
integrated  later. 

The  set  of  variables,  denoted  {p£},  is  constructed  as 

n  □ 

—BWsV  1 

M  M  M  K,l  1 


Bki  Vi  +  Bk  l  v2  =  -Bk 


□ 


^k,2 


-  V2U 


=  —Bk2'(p£  -  V2) 


(33a) 


/=1  Gp\k(v)  exp  l-  £=1  Pi  B 


P\k  1 


+j'Zi  Z5BM*  'viidv. 


The.  integral  with  respect  to  Vo  is  now  taken.  WerUse  a  second 
subscript  to  denote  the  individual  elements  of  the  M  row  of  B 

_  _  O  x  _  A  A  :  n  I  U 


!  B 


=  B 


part 


M 

o1^  till 


At,  1  .  /c,2 

e  exponential  argument  as 


£m/' 

/  £=l  Z  6PII<  V=J 


£=1 


\  ( 

£  M;  \ 

Ziak,  1  1/1+7 


\ 


£=1 


ZiB 


M; 

k,  2 


and  rewrite  J  as 

zk 


V2, 

(30) 


(p- 1 


C/tt 

JZk  (2n)2f^ 

y&ik  00  (  ..  .2 

x  ‘  exp  ~h  ki  I  +jxiVi  +j  ( 


/=! 


Z£iB^Xv' 


□ 

□ 


klk 


£=1 


M’ 


Gp\k(v) expo-  £  ^  Pi  Bp\k v 


■12  \V2\ 


( 

+ j  Z£B1' 


v  +jxv 

£=1  i  k,2 


dv  dv  . 

2 


where 


(29) 


p£  = 


Ref } 

B  *4  p\k 

\  R  =  L‘ 


(33b) 


P\k 


+  1. 


Similarly,. for  the  argument  of  the  exponential,  we  can  construct  a  set 

.r  »  .  7  ,  w 


p|* 

BMi  .  This  allows  us  to  decompose  the  complex 


of 


(f+1 


p£  = 


pf-  M, 

~B  k,2 

12 


,  P\k} 


R  =  L1 ..  +  l 

P\k  T 


and  a  scalar  number  as 
/ 

f/c  =  X2  + 


£=\ 


£  M 

ZiB  A. 


(33c) 


(33d) 


The  solution  to  an  integral  of  an  exponent  of  absolute  values 
requires  dividing  the  domain  of  integration  into  regions  in  which  the 
integrand  is  continuous.  Since  '  {v)  ispiecewise-constant,  itsdis- 


continuities  lie  on  the  boundaries  of  these  regions,  and  hence  G^k(v) 

is  treated  as  a  constant  in  each  integral.  In  order  for  the  Gp^k(v) 
coefficients  to  be  consistent  with  the  form  in  (37),  a  procedure  mir¬ 
roring  that  of  the  measurement  update  process  from  [16, 17]  must  be 
used.  This  involves  rewriting  the  first  bracket  Jgrm'in  (26(q)  a  sum 

ofisign  functions,  using  the  identity  sgn  nk\kV  s®n  Rk\k 

V  (  \  /  \x 

sgn  v  +  Sgn  v  v  2  —  1  and  the  substitutions 

2  k\(k 


V  = 


k\k 


( 


(3D 

2  2O 


Pi  =  c,sgn  —B  A  p2  =  diSgn  B  2\  p£  =  0  for  R  >  2, 

16:  16 


2  1 


Pi  =  1, 

~P2  =  1, 

p£  =  0  for  R  >2. 

(34) 
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Furthermore,  for  the  rest  of  the  bracket  terns  in  (26a)  we  can  rewrite 

CO  „ 

the  sums  given  by  Sk-r  in  (26b)  by  defining  constants  Pf  as 


(0 


p£ 


=  '  B¥rr\k-rHT  sgn^S^  R  1’" 


R=  L' 

p\k 


-  r  +  1, . . 


in  order  to  write  G^k  in  (32)  as 


Gfakiv  -  H  t CT)  = 


L&lk+1  _  LpM+i  . 

/  U  psgn(p  _  cr) ,  ^  psgn(p  —a),... 

Gp\k  D  £  £  £  £ 


£=1 


£=1 


(Z-p/k-2)P£Sgn  (pr  -  a) (Dpfsgn  (p£  -  a) 


£=1 


£=1 


□ 


•  B..^fc+1  n 

:=  G'  P  sgn(p  -cr)  .  (36) 

pik  n  £  £ 

£=1 


where 


,L‘  -r 

p\k 

U  +  1 

(35) 


□ 


□ 


u  1  1 


r-  L-l-“  ^  “  ,  (□ 

Gp|/<  ±p  +  v  p  sgn(p  -p  )  = 

□  m  £  £  m  — 

£=l 
£/=m 


□ 


L'  +  1 


,  C+  .  \2 

3/  +  D;  ±prn  +  p£Sgn(fj£  ~  Pm)  ~  1 


\ 


2  £=1 
£/=m 


□ 


□  □  □ 

+  j  J  ±Pm  +  p£ Sgll  (p£  -  pm)  D 

0An 


L*'k  21  . 

n 


2n 


p=  i 


jQk—r  +  Y  +  S'k_r(^nPm>  p£  Pm) 


W - —  y  +  S' — \±vpm,  p£ 


k—r  k—r 


fJm) 


and  for  re{l,...,L’  -2}, 

k\k 


(38b) 


Here,  the  variable  p£  is  a  shorthand  variable  representing  all  of  the 
[np£,  p£,  and  p£  constants  in  all  of  the  sums  in  G'p ^ 

Then,  to  write  this  integral  in  a  form  consistent  with  the  mea¬ 
surement  update  formula,  let  CT  :=  Vi  in  order  to  write  the  one¬ 
dimensional  integral  as 


ICO  ; 

//,  2  =  G1 


p\k  L 


*-p//c+l 


£=\ 


□ 

P£Sgn(p  -cr)- 
£ 


□  .  +1  □ 

J_l&k  r-| 

x  exp  -  p  |p  -a\+j?a  da.  (37) 

□  £  £  k  □ 

£=1 


S'k-r(±[npm,  P£  ~  Pm) 

=  ±pmKBM r  H?' 

k-r\k-r 

(  \  (  \ 

n  £■  /—y  Mi  ■  ■  T  M.-  / 

u-r/k-rP  B  H  sgn  sgn  (p  p 

irk 

~B„ 


+  •  k—r\k—r 

£=1 


'k,2 


£  m 


£/^m 
V  hAm 

-±pr  s!V,hT\ 

'  <  u  ,x  eoif 

+  P.  s,'rl,_,H  Sg„n-B„(n-  + 

£/=m  Bk,l  k’2 

x  sgn  (t/|) .  (38c) 


The  solution  is  given  as  a  sum  of  L1  _|_  ^  terms  as 

p\k 


□ 


Lj$k+l  □  L'p$tl 

lid  Vi)  =  eXP  □  -  P£  |/j£  -  Pm  |  +j?kpm 

m=  1  £=1 

£/=m  □ 


□ 


£r+l 


hFr/  □  -  ■ 

u  Pl*-  +Pm  +  ’  p£  sgn  (p£  -  pm)L 

n  *=£ 


£J= 


- £'  +1 - 

j?  +  Pm  +  ^  ,  , 

£=1  P£  Sgn  (p£  -  pm) 

£/= 


□  k 


□ 


LNk 

Gp\kL-Pm  + 


)’  P£  Sgn  (p£  —  pm)- 


£/=m 


Liplk+l  □  ' 

j?„~pm+  >’  P£Sgn(P£_  Pm)  ^ 


£=1 

£/=m 


(38a) 


In  the  two-state  estimator  [17],  algebraic  relationships  are  used  to 
reduce  the  complicated  bracket  terns  produced  by  the  update  process 
into  simpler,  polynomial  forms.  Similarly,  those  same  algebraic 
relationships  are  used  here  to  simplify  (38a).  Denote  the  coefficient 
of  the  mth  tenn  in  the  sum  as  Gp\k  and  consider  the  numerators  of 

the  coefficient  function  in  the  brackets  in  (38a)  given  by  (38b).  This 
complicated  term  can  be  reduced  to  a  simple  function  of  sgn  ( Ip) 

by  rewriting  it  in  the  form  of  the  left  hand  side  of  (48a),  and 
then  applying  the  algebraic  identity  presented  in  the  Appendix.  This 
process  begins  by  rewriting  the  numerator  functions  (38b)  in  a  four 

parameter  form  involving  only  sgn(lp)  as 


mm  mm 

a i  ±  bi  ■  sgn  (vi)  ±  jd  +  jdi  ■  sgn  ( Vi) .  (39) 


Then,  the  real  part  of  the  exponential  argument  in  (38a)  and  the 
sums  in  the  denominators  of  the  coefficient  function  in  (38b)  can 
be  reduced  to  a  single  constant  times  |  i/i  |  or  sgn  ( t/i),  respectively, 
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where  those  constants  are  defined  as 

^+1 

Di,m\v  i|  = 

£=1 

£/= 


□ 


P£ 


-8^'  □ 


£=1 

£/=m 

%k+1 


~W~  “sf 


TTi- □  |n  |,  (40a) 


D/,m sgn  ( n)  =  P£S8n  <P£  -  Vm) 

£=1 
£/ —m 

□  ^+1  t  ~BVv 

=  □  prsgn - Mm  ^Dsgnti/i). 


£=1 

£/=m 


M  B 


Expressing  the  numerators  of  (38b)  as  (39),  and  using  D,;m,  then  the 

result  in  the  Appendix  can  be  used  to  express  the  coefficient  function 
as  a  two  parameter  form  given  in  (48a),  where  those  parameters  are 
denoted  a,,m  and  d/>m.  For  the  exponential  argument,  use  D/  m  and 
rewrite  the  complex  part  as 

□  ~B  a 

kV\  =  Q&-  MS  -Vi 

/  k,  1 

Dk,  2 

in  order  to  express  (38a)  in  a  much  simpler  fonn  given  by 


(41) 


.  -D,,m  Ini  +y?*n 


V- 


\ 


1  v  v  x2 

a/  +  bi  -  ±  pm+  pfsgn  (p2  -  pi)  -1 

1  £=1 


£/=m  □ 

.□ 


If+  1 


+/  ±P  +  P  sgn (p  -p ) 

□  1  £  2  1  □ 


( 


£=2 


=  a,±fa,sgn  Bm'i  h.  sgn (p  -p) 


/c,2  ’  D/c,2  (  \2  1  (  \ 

±yc,sgn  BM'  +  jd  sgn  BMj  sgn(p  -p) 

t<,2  '  fc,2  2  l 

M 1  AiP  U  M/  Sm/D 

fc,l 

B  - 


(  \ 

=  a,±P/sgn  B  '  .  o  '  sgn 

It, 2  D/(.2 


/c,l 


□  _ 


+ 


-  sgn(n) 


(  M\ 


..2  A<fl 

&Za  t%'  -.,0 


( 


#\  BMl 


k, 2  6m>. 


lc,l 


±yc,sgn  6  '  +yc/sgn  B  '  sgn--  m  .  sgn  ( v, ) . 

*’2  '  fc-2  ww?  nM' 

Op  9  D/f9 

,z  (43) 


The  same  manipulations  can  be  done  for  the  numerators  for  m  =  2 

by  interchanging  pi  and  C,  with  P2  and  di,  respectively.  The  +  and 
—  in  (43)  refer  to  the  left  and  right  numerators,  respectively,  in  (38a) 


(38b)  by  substituting  in  (38c).  This  product  is  given  in  Result  2  from 
the  Appendix  of  [17]  and  produces  a  term  of  the  same  form  as  (43). 
Hence,  this  result  can  be  combined  with  the  next  bracket  tenn  in  the 
product  in  (38c),  eventually  expressing  both  numerators  as  (39). 

For  m  >  2,  this  first  bracket  term  produces  a  simpler  form  due 

to  two.  sign  Junctions  canceling  each,  other  out.  Recalling  that  Qe  = 
p£  =  l)  i or  2,  the  new  term  obtained  for  m  =  3  corresponds  to 

the  coefficient 

(  U  +i 

l  (  W  \ 


\ 


3i  +  bi  -  ±pm+  p£ sgn  (pi  -  p3)  -  1 

2  £=1 

£/=m  D  D 

+  /□  .  U*<+[  □ 

7  Jipi  +  pfsgn  (p2  —  pa)  □ 

£=2 

, ,  (  Mi  2  \ 

(4Ub)  _  a(.+6(Sg£  x  sgn  (pi  -  p3)  sgi  p3) 

sgn  (p  p  )  +jd  sgn  M  sgn  (p  p) 
+  yc,sgn  ,-3  Bk{  2-  3 

B  M}  BM'~  "  B  Bry,; 

=  a/  +  P/sgn  L  —  ^2r+_LsgnL_  ^  +— LL 

BK,2  S/c,2  B/c,2 

+y  C/sgn  L  ^  ^  +  c//sgn  +  %?3r  - 

□  □  - +  —  _  □  -  - □□ 


BMi  Ettl 


k,  2 


X  sgn  ( IA ) .  (44) 


//, 2(V\)  =  {aim  +jdi,msgn(vl)}  ■  e 

m=  1 

(42) 

To  obtain  the  fonn  in  (39)  for  (38b),  a  procedure  mirroring  that 
of  the  estimator  update  process  is  applied,  which  uses  the  algebraic 
relationships  presented  in  the  Appendix  of  [17].  For  m  =  1  and 
m  =  2,  the  numerators  of  (38a)  are  not  equal  and  hence  cannot 
come  out  of  the  bracket  term.  Then,  the  numerators  for  m  =  1  are 
given  by 


Now,  substitute  (42)  into  (31)  and  denote  the  outer  integral  with 
respect  to  Vi  by 


L'puk+1  t°° 


/ i,  1  —  {di,m  "by  d/',mSgn  ( V\ )} 

r^=1  -CO  » (.  \ 

x  exp  —Dj,m\v- ll  +j?  exp(-p  \v  \+jx  v  ) 


kv  1 


ti  ii 

(.^27£  M,  X  . 

:  exp  J  2  ,-Bkl  dvx 

£=1 


LPlk+l  fbo 

_  ^  {ai,m  +  j  di,m  Sgn  ( 1/! )} 

~  -oo 

C  ^  '  X 

xexp  -  Di.m  +  rp  kil  +j  ^  +  xi+  v>  dvh 

£=1 

(45) 

This  integral  has  a  form  identical  to  the  measurement  update  process 
for  a  scalar  system  [10].  Its  solution  is  given  by 


i,+l 


ai  m  ~ /  di.. 


//,  l  = 


177=  1 


LPlk+ 1 


□  £'+*1  2  Z,B  I  D/,m  +  t?l 

L  -  +£=1  £  fc/1  + 

k 

—  di\rn  4 ' _/  djfm 

-j  i>+x1  +  \ ’z*  J 

k  iBk,\  —  D  +  n 

£=1  4,  I 

^/,177  Di,m  "F  Pi  c//,m  X  T  -  ZjB/yi^ 


£  / 

£=1 

2 

that  are  given  by  (38b). 

Hence,  since  the  first  bracket  tenn  in  (38b)  can  be  expressed  as 
(43),  it  can  be  combined  with  a  bracket  tenn  from  the  product  in 

2 

m=\ 

\  y  l  2  * 

£*  +  *1  +  ’  +  Di,m  +  Hi 

£=1 

(46) 
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Finally,  using  (46)  in  (31)  and  reintroducing  the  time  dependence 
notation,  the  conditional  performance  index  is  given  by 


zk 

□ 

□ 

□ 

□ 


=  1 

2rrf 


L 


\ 


.+i 


Ppj,  Z/TT  ''\'k  Lfyk 

u-(  i)  +  n  . 

Zk  '=k  b/  i=l  m=  1  I 

Di.m  +  rjp,  1  _  di,m  x,(p)  +  Z£g ™f 

£=1  '  ’ 

- 1  - Di.m - 


(47) 


f. 


-Q 


U  <rik  +  *i(p)  +  £=t  +  +  Hp.i 

This  closed  form  conditional  performance  index  was  ob¬ 
served  to  be  non-convex  and  depends  on  the  control  input  se¬ 
quence  {u(k), ... ,  u(p  —  1)}  in  a  complex  way.  Specifically, 

X\(p)  JGfp)  1  =  x(p)  depends  on  the  control  sequence,  along 
with  the  parameters  3;,m  and  d/,m  that  also  depend  on  the  measure¬ 
ment  sequence  z.k-  Therefore,  we  maximize  (47)  numerically.  The 
optimization  is  done  in  two  steps:  first,  the  global  optimum  of  the 
conditional  performance  index,  expressed  as  a  double  sum  in  (47), 
is  optimized  with  respect  to  the  control  input  with  a  coarse  gradient 
search  in  order  to  find  an  approximate  optimal  control  sequence. 
Then,  we  use  the  accelerated  gradient  search  method  [21]  starting 
from  that  approximate  sequence  in  order  to  refine  the  optimal  control 
solution. 


VI.  Numerical  Examples 

Here,  we  present  two  sets  of  examples,  the  first  of  which  shows 
the  optimal  control  versus  the  measurement  for  the  first  time  step 
only,  and  the  second  set  shows  two  multi-step  examples.  All  of  these 
examples  use  a  two-step  horizon,  i.e.  m  =  2,  so  that  there  exists 
a  control  sequence  that  can  drive  our  two-state  system  to  the  origin 
over  this  horizon  length.  However,  as  we  are  using  model  predictive 
control,  only  the  first  control  input  of  this  sequence  is  applied  at  that 
time  step.  Gaussian  parameters  used  for  the  LEG  and  for  Gaussian 
noises  are  closest  in  the  SaS  sense  to  their  corresponding  Cauchy 
distributions.  For  clarity  of  presentation  in  the  figures,  the  first  update 
occurs  at  k  =  0  instead  of  k  =  1.  Furthermore,  since  the  control 
weightings  remain  constant  across  time  steps,  we  will  drop  the  time 

notation  from  the  parameters  p  and  if. 

All  the  simulations  use  the  same  system  dynamics  with  H  =  [  1  1], 
T1  =  [0.5  1],  A1  =  [0.5  1],  m  =  2,  and  the  eigenvalues  of  ®  are 

0.8  ±  0.55/  The  terminal  state  weightings  are  p  =  [1  1],  and  when 
control  weightings  are  used  they  are  <f  =  10.  The  initial  condition’s 

scaling  parameters  are  given  by  a  =  [0.8  0.8],  and  the  process  and 
measurement  noise  parameters  /3  and  y  are  either  0.5  or  0.1. 

All  of  our  examples  compare  our  Cauchy  optimal  model  predictive 
controller  with  a  similar  LEG  model  predictive  optimal  controller  [1], 
Hie  LEG  estimator  assumes  that  the  stochastic  inputs  are  described 
by  the  Gaussian  pdfs  that  are  closest,  in  a  least  squared  sense,  to  the 
given  Cauchy  pdfs.  The  LEG  controller  assumes  that  its  objective 
functions  of  the  state  and  control  resemble  scaled  Gaussian  pdfs  that 
are  closest,  in  the  least-squared  sense,  to  the  scaled  Cauchy  pdfs 
in  (10).  The  LEG  design  details  can  be  found  in  [12-14,16,17], 
where  similar  comparisons  and  least  square  fits  were  used.  The  LEG 
controllers’  responses  are  shown  in  dashed  lines  in  the  figures. 

A.  One-Step  Control  Examples 

The  first  set  of  examples  are  shown  in  Fig.  1 .  These  figures  show 
the  applied  optimal  control  input  at  the  first  time  step  given  the  first 
measurement.  In  the  two  cases  presented,  all  the  systems  parameters 
are  the  same,  except  in  Fig.  1(a)  y  >  P  (i.e.,  more  measurement 
than  state  uncertainty),  and  in  Fig.  1(b)  ft  >  y  (i.e.,  more  state  than 
measurement  uncertainty). 


The  example  in  Fig.  1(a)  shows  that  the  Cauchy  controller  is  nearly 
linear  for  small  measurements  and  reduces  its  control  effort  to  zero 
as  the  measurement  deviations  become  large.  This  is  in  contrast  to 
the  LEG  controller,  which  is  linear  and  thus  responds  strongly  to 

large  measurement  deviations.  This  behavior  in  the  Cauchy  controller 
occurs  when  the  measurement  uncertainty  is  larger  than  the  state 
uncertainty.  In  the  opposite  case  shown  in  Fig.  1(b),  the  measurement 
has  less  uncertainty  than  the  state.  Here,  the  Cauchy  controller’s 
response  closely  matches  that  of  the  LEG  in  a  neighborhood  of  the 
origin,  and  in  fact  responds  even  more  strongly  than  the  LEG  for 
large  measurement  deviations. 

The  three  different  curves  in  both  of  these  figures  depict  the  control 
signals  for  three  different  control  weights:  no  control  weight,  (  =  10, 
and  <f  =  5.  As  expected,  heavier  control  weights  (i.e.  smaller  if) 
reduce  the  control  effort.  Even  without  any  control  weighting,  the 
response  in  Fig.  1(a)  goes  to  zero  for  large  measurement  deviations. 
The  fact  that  this  behavior  is  seen  when  there  is  no  control  weighting 
implies  that  the  attenuation  of  the  control  signal  for  large  measure¬ 
ment  deviations  is  due  to  the  cpdf  and  not  the  objective  function. 
Moreover,  this  behavior  is  not  shared  by  the  LEG  controller  that 
uses  a  similar  objective  function  but  assumes  light-tailed,  Gaussian 
distributions. 


Fig.  1 .  Optimal  control  vs  the  measurement  for  the  first  time  step. 


B.  Multi-Step  Control  Examples 

The  complexity  of  evaluating  the  conditional  performance  index 
grows  as  the  number  of  terms  increases  across  time  steps,  as  indicated 
in  (47).  For  implementable  control,  this  growth  needs  to  be  arrested. 
The  full  information  CF  of  the  ucpdf  (25)  is  approximated  by  a  CF  of 
a  ucpdf  conditioned  on  a  fixed  sliding  windowofthe  most  recent  mea¬ 
surements,  as  described  in  [1 7],  where  the  number  of  measurements 
in  this  horizon  is  denoted  Nz .  Hence,  the  first  Nz  measurement 
updates  are  perfonned  nonnally,  and  the  control  optimization  is 
performed  using  this  CF.  Then,  for  time  steps  k  >  Nz,  we  initialize 
a  new  finite  horizon  ( EH)  estimator  and  perform  Nz  measurement 
updates  over  the  fixed  window  {z(k  —  Nz  +  1 z(k)}.  The 
conditional  perfonnance  index  (15)  is  evaluated  using  this  FH  CF. 
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The  remaining  examples  in  Figs  2,  3,  5,  4,  and  6  are  all  multistep 
examples  over  100  measurements,  and  all  use  a  horizon  length 
of  Nz  =  8.  The  variations  between  the  simulations  are  in  the 
stochastic  parameters,  alternating  which  of  process  or  measurement 
noise  dominates  the  other  and  whether  the  noises  are  Gaussian  or 
Cauchy. 

In  Fig.  2,  there  is  more  uncertainty  in  the  state  process  noise 
than  in  the  measurement  noise.  When  large  measurement  deviations 
occur  (such  as  at  k  =  52),  the  Cauchy  controller’s  effort  is  very 
small  even  though  there  is  no  weighting  on  the  control  inputs.  In 
contrast,  the  LEG  controller  responds  with  a  large  control  effort 
that  drives  the  states  from  their  regulated  state  of  zero.  However, 
when  large  process  noise  inputs  occur,  the  state  deviates  and  the 
Cauchy  controller  applies  a  larger  control  effort  than  the  LEG,  thus 
regulating  the  state  more  effectively.  It  is  interesting  to  compare 
Fig.  2  with  Fig.  3  in  light  of  Fig.  1.  They  suggest  that  when  the 
measurement  noise  density  parameter  dominates  the  process  noise 
density  parameter  in  constructing  the  Cauchy  controller,  the  effect 
of  measurement  outliers  is  mitigated,  while  still  responding  to  state 
deviations  due  to  process  noise. 

On  the  other  hand,  in  Fig.  3  there  is  more  uncertainty  in  the 

state  than  in  the  measurement,  and  the  Cauchy  controller  behaves 
very  much  like  the  linear  LEG  controller.  The  state  trajectories  and 

control  inputs  of  the  Cauchy  and  LEG  controllers  appear  equal,  but 
actually  their  differences  are  much  smaller  than  the  scale  of  the  axis 
and  cannot  be  seen.  Similar  behavior  was  observed  in  Fig.  1(b), 
suggesting  that  when  the  stochastic  parameters  allow  it,  the  Cauchy 
controller  follows  the  measurement  more.  It  responds  in  a  more  linear 
fashion  to  the  measurements,  similar  to  the  performance  of  the  LEG 
controller  in  that  setting. 

no®iO?mVWe^W 

both  the  control  and  state  trajectories  of  the  LEG,  which  is  the 


true  optimal  solution.  This  demonstrates  that  the  Cauchy  controller 
is  robust  under  non-impulsive  noise  environments,  as  it  closely 
approximates  the  true  optimal  solution  given  by  the  LEG. 

Fig.  5  shows  two  examples  using  control  weighting  if  =  10, 
where  otherwise  Fig.  5(a)  has  the  same  parameters  as  Fig.  2,  and 
similarly  with  Figs.  5(b)  and  3.  Applying  control  weighting  appears 
to  slow  down  the  reaction  from  the  Cauchy  controller,  as  seen  in  the 
response  to  the  initial  process  noise  pulses  in  Fig.  2.  In  Fig.  5(a) 
with  control  weighting,  a  one  step  delay  is  observed  in  the  Cauchy 
controller’s  response  before  applying  the  regulating  control  input. 
This  behavior  appears  again  in  Fig.  5(b),  where  the  Cauchy  controller 
is  very  delayed  in  applying  control,  leading  to  much  larger  deviations 
of  the  state  than  the  LEG  allowed.  This  suggests  that  excessive 
control  weighting  can  adversely  affect  the  regulation  performance  of 
the  Cauchy  controller  in  response  to  process  noise  pulses.  However, 
in  a  situation  where  the  measurement  noise  dominates,  the  control 
weighting  haS  a  much  smaller  effect,  and  improves  performance  in 
ignoring  measurement  outliers. 

The  final  set  of  results  in  Fig.  6  show  situations  where  either 
measurement  or  process  noise  is  Cauchy  and  the  other  is  Gaussian. 
In  Fig.  6(a)  the  measurement  noise  is  Cauchy  and  the  process  noise  is 
Gaussian.  Hence,  there  are  no  large  state  deviations,  and  the  Cauchy 
controller  perfonns  similarly  to  the  example  in  Fig.  2,  effectively 
ignoring  the  measurement  deviations.  In  Fig.  6(b),  the  measurement 
noise  is  Gaussian  and  the  process  noise  is  Cauchy,  and  the  Cauchy 
controller  perfonns  similarly  to  the  example  in  Fig.  3.  Here,  the  large 
deviations  occur  in  the  state,  but  there  are  no  large  measurement 
deviations.  The  Cauchy  controller’s  performance  closely  matches  that 
of  the  LEG  in  this  case,  except  for  the  early  time  steps. 


VII.  Conclusions 

An  optimal  stochastic  controller  was  derived  for  vector-state,  linear, 
discrete-time  systems  with  additive  process  and  measurement  Cauchy 
distributed  noises.  Since  the  Cauchy  distribution  has  an  undefined 
mean  and  an  infinite  second  moment,  we  cannot  use  standard 
objective  functions,  e.g.,  the  expected  value  of  a  quadratic  function 
of  state  and  control  variables.  Therefore,  a  new  and  computable 
objective  function  was  defined.  As  opposed  to  previous  work,  the 
CF  of  the  cpdf  of  the  state  given  the  measurement  history  and  the 
Parseval’s  identity  are  used  to  express  the  conditional  performance 
index  in  a  closed  fonn.  This  closed-form  conditional  performance 
index  is  optimized  numerically  using  an  accelerated  gradient  search. 
Examples  are  presented  that  show  how  the  proposed  vector  state 
Cauchy  controller  compares  against  an  equivalent  LEG  controller, 
demonstrating  the  Cauchy  controller’s  performance  and  improved 
robustness  over  its  Gaussian  counterpart. 

Appendix 

The  term  given  by 

am  +  bm  ■  sgn  ( \/i)  +  jcm  +  jdm  ■  sgn  ( t/[) 

/ _ / _ v  / _ / _ 

je  +  pm  +  D  ■  sgn  (V\) 
aT  -  bF  ■  sgn  (t/i)  -  jCjm+  jdim  sgn  ( Vi) 

j6  -  pm  +  D  ■  sgn  ( V] ) 

~  &i,m  +  jdim  '  sgn(tq)  (48a) 

where  the  parameters  a,;m  and  d,;m  are  generated  from  the  other 
parameters  by  the  linear  relationships 

“■"-i  9  i  8  4#»  t  4 

di,m 


-D  pm  0  -0 

dr" 

(48b) 

where 

a  =  D2  -p2  -  02,  b  =  -78y, 

m 

A  =  tt  ■  a2  +  b2  . 

(48c) 
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(a)  Full  view. 


Fig.  2.  Cauchy  and  LEG  controllers’  performance  when  the  measurement 
noise  dominates  the  process  noise,  y  =  0.5  and  fi  =  0.1,  and  without  control 
weighting. 


Fig.  5.  Cauchy  and  LEG  controllers’  performance  with  control  weighting, 

<T  =  [10  10]. 


Fig.  6.  Cauchy  and  LEG  controllers’  performance  against  mixed  Cauchy  and 
Gaussian  noises. 


